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USE OF HEREGULIN AS AN EPITHELIAL CELL GROWTH FACTOR 

BACKGROUND OF THE INVENTION 
5 Field of the Invention 

This invention relates to the use of HER2. HER3 and/or HER4 ligands. in particular heregulin 
polypeptides, as epithelial cell growth factors. 

Description of Background and Related Art 
The HER (ErbB) family belongs to the subclass I receptor tyrosine kinase superfamily and consists 

10 of three distinct receptors, HER2, HER3, and HER4. A ligand for this ErbB family is the protein heregulin 
(HRG), a multidomain containing protein with at least 15 distinct isoforms. 

Transduction of signals that regulate cell growth and differentiation is regulated in part by 
phosphorylation of various cellular proteins. Protein tyrosine kinases are enzymes that catalyze this process. 
Receptor protein tyrosine kinases are believed to direct cellular growth via ligand-stimuiated tyrosine 

15 phosphorylation of intracellular substrates. .Growth factor receptor protein tyrosine kinases of the class I 
subfamily include the 170 kDa epidermal growth factor receptor (EGFR) encoded by the erbB\ gene. er^Bl 
has been causally implicated in human malignancy. In particular, increased expression of this gene has been 
observed in more aggressive carcinomas of the breast, bladder, lung and stomach. 

The second member of the class I subfamily, pI85 ncu was originally identified as the product of the 

20 transforming gene from neuroblastomas of chemically treated rats. The next gene (also called erbBl and 
HER2) encodes a 185 kDa receptor protein tyrosine kinase. Amplification and/or overexpression of the 
human HER2 gene correlates with a poor prognosis in breast and ovarian cancers (Slamon et al.. Science 
235:177-182 (1987); and Slamon et al.. Science 244:707-712 (1989)). Overexpression of HER2 has been 
correlated with other carcinomas including carcinomas of the stomach, endometrium, salivary gland, lung, 

25 kidney, colon and bladder. Accordingly, Slamon et al. in US Pat No. 4,968,603 describe and claim various 
diagnostic assays for determining HER2 gene amplification or expression in tumor cells. Slamon et al. 
discovered that the presence of multiple gene copies of HER2 oncogene in tumor cells indicates that the 
disease is more likely to spread beyond the primary rumor site, and that the disease may therefore require 
more aggressive treatment than might otherwise be indicated by other diagnostic factors. Slamon et al. 

30 conclude that the HER2 gene amplification test, together with the determination of lymph node status, 
provides greatly improved prognostic utility. 

A further related gene, called erbB3 or HER3, has also been described. See US Pat. No. 5,183,884; 
Kraus et al., Proc. Natl. Acad. Sci. USA 86:9193-9197 (1989); EP Pat Appln No 444,961A1; and Kraus et 
al., Proc. Natl. Acad. Sci. USA 90:2900-2904 (1993). Kraus et al. (1989) discovered that markedly elevated 

35 levels of erbB3 mRNA were present in certain human mammary tumor cell lines indicating that erbB3, like 
er6Bl and erbBl, may play a role in human malignancies. Also, Kraus et al. (1993) showed that EGF- 
dependent activation of the ErbB3 catalytic domain of a chimeric EGFR/ErbB3 receptor resulted in a 
proliferative response in transfected NIH-3T3 cells. This is now believed to be the result of endogenous 
ErbBl or ErbB2 in NIH-3T3. Furthermore, these researchers demonstrated that some human mammary 

40 tumor cell lines display a significant elevation of steady-state ErbB3 tyrosine phosphorylation further 
indicating that this receptor may play a role in human malignancies. The role of erbB3 in cancer has been 
explored by others. It has been found to be overexpressed in breast (Lemoine et al., Br. J. Cancer 66:1 1 16- 
1121 (1992)). gastrointestinal (Poller et al.. J. Pathol 168:275-280 (1992). Rajkumer et al., J. Pathol. 
170:271-278 (1993), and Sanidas et aL Int. J. Cancer 54:935-940 (1993)), and pancreatic cancers (Lemoine 

45 etai, J. Pathol. 168:269-273 (1992), and Friess et al., Clinical Cancer Research 1:1413-1420 (1995)). 

The class I subfamily of growth factor receptor protein tyrosine kinases has been further extended to 

1 
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include the HER4/Erb4 receptor See EP Pat Appln No 599.274; Plowman et at.. Proc. Natl. Acad. Sci USA 
90: 1746-1750 (1993); and Plowman et al.. Nature 366:473-475 (1993). Plowman et al. found that increased 
HER4 expression closely correlated with certain carcinomas of epithelial origin, including breast 
adenocarcinomas. Diagnostic methods for detection of human neoplastic conditions (especially breast 
5 cancers) which evaluate HER4 expression are described in EP Pat Appln No. 599.274. 

The quest for the activator of the HER2 oncogene has lead to the discovery of a family of heregulin 
polypeptides. These proteins appear to result from alternate splicing of a single gene which was mapped to 
the short arm of human chromosome 8 by Orr-Urtreger et al., Proc. Natl Acad. Set. USA 90:1867- 
1871(1993). See also Lee and Wood, Genomics, 16:790-791 (1993). 
10 Holmes et al. isolated and cloned a family of polypeptide activators for the HER2 receptor which 

they called heregulin-cca (HRG-a), heregulin-pi (HRG-Pl), heregulin-p2 (HRG-P2), heregulin-p2-like 
(HRG-p2-like), and heregulin-p3 (HRG-p3). See Holmes et at.. Science 256:1205-1210 (1992); WO 
92/20798; and U.S. Patent 5,367,060. The 45 kDa polypeptide, HRG-a, was purified from the conditioned 
medium of the MDA-MB-23I human breast cancer cell line. These researchers demonstrated the ability of 

15 the purified heregulin polypeptides to activate tyrosine phosphorylation of the HER2 receptor in MCF7 
breast tumor cells. Furthermore, the mitogenic activity of the heregulin polypeptides on SK-BR-3 cells 
(which express high levels of the HER2 receptor) was illustrated. Like other growth factors which belong to 
the EGF family, soluble HRG polypeptides appear to be derived from a membrane bound precursor (called 
pro-HRG) which is proteolytically processed to release the 45kDa soluble form. These pro-HRGs lack a N- 

20 terminal signal peptide. 

While heregulins are substantially identical in the first 213 amino acid residues, they are classified 

into two major types, a and p, based on two variant EGF-like domains which differ in their C-terminal 
portions. Nevertheless, these EGF-like domains are identical in the spacing of six cysteine residues contained 
therein. Based on an amino acid sequence comparison, Holmes et al. found that between the first and sixth 
" ' V 'cysteines in the EGF-like domain, HRGs were 45% similar to heparin-binding EGF-like growth factor (HB- 
EGF), 35% identical to amphireguiin (AR), 32% identical to TGF-oc, and 27% identical to EGF. 

The 44 kDa neu differentiation factor (NDF), which is the rat equivalent of human HRG, was first 
described by Pefes et at.. Cell, 69:205-216 (1992); and Wen et al.. Celt, 69:559-572 (1992). Like the HRG 
polypeptides, NDF has an immunoglobulin (Ig) homology domain followed by an EGF-like domain and 

30 lacks a N -terminal signal peptide. Subsequently, Wen et at., Mot. Cell. Biol. , 14(3): 1909-1919 (1994) carried 
out "exhaustive cloning" to extend the family of NDFs. This work revealed six distinct fibroblastic pro- 
NDFs. Adopting the nomenclature of Holmes et at., the NDFs are classified as either a or p polypeptides 
based on the sequences of the EGF-like domains, lsoforms 1 to 4 are characterized on the basis of the 
variable justamembrane stretch (between the EGF-like domain and transmembrane domain). Also, isoforms 

35 a, b and c arc described which have variable length cytoplasmic domains. These researchers conclude that 
different NDF isoforms are generated by alternative splicing and perform distinct tissue-specific functions. 
See also EP 505 148; WO 93/22424; and WO 94/28133 concerning NDF. 

Falls et al. Cell, 72:801-815 (1993) describe another member of the heregulin family which they 
call acetylcholine receptor inducing activity (ARJA) polypeptide. The chicken-derived ARIA polypeptide 

40 stimulates synthesis of muscle acetylcholine receptors. See also WO 94/08007. ARIA is a P-type heregulin 
and lacks the entire spacer region rich in glycosylation sites between the Ig-like domain and EGF-like 

domain of HRGa, and HRGpl-p3. 

Marchionni et al., Nature, 362:312-318 (1993) identified several bovine-derived proteins which 
they call glial growth factors (GGFs). These GGFs share the Ig-like domain and EGF-like domain with the 
45 other heregulin proteins described above, but also have an amino-terminal kringle domain. GGFs generally 
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do not have the complete glycosylated spacer region between the Ig-like domain and EGF-like domain. 

Only one of the GGFs, GGFII, possessed a N-terminal signal peptide. See also WO 92/18627; WO 

94/00140; WO 94/04560; WO 94/26298; and WO 95/32724 which refer to GGFs and uses thereof. 

Ho et ai in J. Biol. Chem. 270(4): 14523-14532 (1995) describe another member of the heregulin 

5 family called sensory and motor neuron-derived factor (SMDF). This protein has an EGF-like domain 

characteristic of all other heregulin polypeptides but a distinct N-terminal domain. The major structural 

difference berween SMDF and the other heregulin polypeptides is the lack in SMDF of the Ig-like domain 

and the "glyco" spacer characteristic of all the other heregulin polypeptides. Another feature of SMDF is the 

presence of two stretches of hydrophobic amino acids near the N-terminus. 

10 While the heregulin polypeptides were first identified based on their ability to activate the HER2 

receptor (see Holmes et ai. supra), it was discovered that certain ovarian cells expressing neu and neu- 

transfected fibroblasts did not bind or crosslink to NDF, nor did they respond to NDF to undergo tyrosine 

phosphorylation (Peles et ai, EMBO J. 12:961-971 (1993)). This indicated another cellular component was 

125 

necessary for conferring full heregulin responsiveness. Carraway et ai. subsequently demonstrated that ~ I- 
15 rHRGpi 177 _244 DOUnd t0 N1H-3T3 fibroblasts stably transfected with bovine erbB3 but not to non- 
transfected parental cells. Accordingly, they conclude that ErbB3 is a receptor for HRG and mediates 
phosphorylation of intrinsic tyrosine residues as well as phosphorylation of ErbB2 receptor in cells which 
express both receptors. Carraway et ai. J. Biol. Chem. 269(19). 14303-14306 (1994). Sliwkowski et ai. J. 
Biol. Chem. 269(20): 14661-14665 (1994) found that cells transfected with HER3 alone show low affinities 
20 for heregulin, whereas cells transfected with both HER2 and HER3 show higher affinities. 

This observation correlates with the "receptor cross-talking" described previously by Kokai et at.. 
Cell 58:287-292 (1989); Stern et ai, EMBO J. 7:995-1001 (1988); and King et ai, 4. 13-18 (1989). These 
researchers found that binding of EGF to the EGFR resulted in activation of the EGFR kinase domain and 
cross-phosphorylation of pi 85^. This is believed to be a result of ligand-induced receptor 
25 heterodimerization and the concomitant cross-phosphorylation of the receptors within the heterodimer (Wada 
et al. , Celt 61:1 339- 1 347 ( 1 990)). 

HER4 HFP ? 

Plowman and his colleagues have similarly studied p 1 85 /pi 85 activation. They expressed 
pl85 HER2 alone, pl85 HER4 alone, or the two receptors together in human T lymphocytes and demonstrated 
that heregulin is capable of stimulating tyrosine phosphorylation of pl85 HER4 , but could only stimulate 

30 pl85 HER2 phosphorylation in cells expressing both receptors. Plowman et ai. Nature 336:473-475 (1993). 

The biological role of heregulin has been investigated by several groups. For example, Falls et ai, 
(discussed above) found that ARIA plays a role in myotube differentiation, namely affecting the synthesis 
and concentration of neurotransmitter receptors in the postsynaptic muscle cells of motor neurons. Corfas 
and Fischbach demonstrated that ARIA also increases the number of sodium channels in chick muscle. 

35 Corfas and Fischbach, J. Neuroscience, 13(5): 2118-2125 (1993). It has also been shown that GGFII is 
mitogenic for subconfluent quiescent human myoblasts and that differentiation of clonal human myoblasts in 
the continuous presence of GGFII results in greater numbers of myotubes after six days of differentiation 
(Sklar et ai, J. Cell Biochem.. Abst. W462, I8D, 540 (1994)). See also WO 94/26298 published November 
24, 1994. 

40 Holmes et ai. supra, found that HRG exerted a mitogenic effect on mammary cell lines (such as 

SK-BR-3 and MCF-7). The mitogenic activity of GGFs on Schwann cells has also been reported. See, e.g., 
Brockes et ai. J. Bioi Chem. 255(1 8):8374-8377 (1980); Lemke and Brockes, J. NeuroscL 4:75-83 (1984 
Lemke and Brockes, J. Neurosci. 4:75-83 (1984); Brockes et ai, Ann. Neuroi 20(3):3 17-322 (1986); 
Brockes. J., Methods in Enzym.. 147: 217-225 (1987) and Marchionni et ai. supra. Schwann cells constitute 
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important glial cells which provide myelin sheathing around the axons of neurons, thereby forming 
individual nerve fibers. Thus, it is apparent that Schwann cells play an important role in the development, 
function and regeneration of peripheral nerves. The implications of this from a therapeutic standpoint have 
been addressed by Levi et ai, J Neuroscience 14(3): 1309- 13 19 (1994). Levi et ai discuss the potential for 
5 construction of a cellular prosthesis comprising human Schwann cells which could be transplanted into areas 
of damaged spinal cord. Methods for culturing Schwann cells ex vivo have been described. See WO 
94/00140 and Li etai.J. Neuroscience 1 6(6):20 12-20 19 ( 1996). 

PinJkas-Kj-amarski et al. found that NDF seems to be expressed in neurons and glial cells in 
embryonic and adult rat brain and primary cultures of rat brain ceils, and suggested that it may act as a 

10 survival and maturation factor for astrocytes (Pinkas-Kramarski et ai, PNAS, USA 91:9387-9391 (1994)). 
Meyer and Birchmeier, PNAS. USA 91:1064-1068 (1994) analyzed expression of heregulin during mouse 
embryogenesis and in the perinatal animal using in situ hybridization and RNase protection experiments. See 
also Meyer et ai.. Development 124(1 8):3575-3586 (1997). These authors conclude that, based on expression 
of this molecule, heregulin plays a role in vivo as a mesenchymal and neuronal factor. Similarly, Danilenko 

15 et ai. Abstract 3101, FASEB 8(4-5):A535 (1994); Danilenko et al.. Journal of Clinical Investigation 95(2): 
842-851 (1995), found that the interaction of NDF and the HER2 receptor is important in directing epidermal 
migration and differentiation during wound repair. 

Ram et aL< Journal of Cellular Physiology 163:589-596 (1995) evaluated the mitogenic activity of 
NDF on the immortalized human mammary epithelial cell line MCF-10A. Danilenko et ai, J. Clin. Invest. 

20 95:842-851 (1995) investigated whether NDF would influence epidermal migration in an in vivo model of 
excisional deep partial-thickness wound repair. It is reported that there were no statistically significant 
differences in proliferating basal and superbasal keratinocytes in control wounds vs. wounds treated with 
rhNDF-a 2 . Marikovsky et ai t Oncogene 10:1403-141 1 (1995), studied the proliferative responses of an 
aneupioid BALB/MK continuous keratinocyte cell line and evaluated the effects of a- and {J-isoforms of 

25 NDF on epidermal keratinocytes. 

' The relationship between the structure and function of new proteins can be investigated using any of 

a variety of available mutational analysis techniques. Examples of such techniques include alanine scanning 
mutagenesis and phagemid display. Alanine scanning can be used to identify active residues (i.e., residues 
that have a significant effect on protein function) in a protein or protein domain. For example, Cunningham 

30 and Wells used alanine scanning to identify residues in human growth hormone that were important for 
binding its receptor. Cunningham and Wells. Science 244:1081-1085 (1989). In alanine scanning, a gene 
encoding the protein or domain to be scanned is inserted into an expression vector, and mutagenesis is 
carried out to generate a series of vectors that encode proteins or domains in which sequential residues are 
converted to alanine. The encoded proteins or domain are expressed from these vectors, and the activities of 

35 the alanine-substituted variants are then tested to identify those with altered activity. An alteration in activity 
indicates that the residue at the alanine-substituted position is an active residue. 

Phagemid display was developed to allow the screening of a large number of variant polypeptides 
for a particular binding activity. Smith and Parmley demonstrated that foreign peptides can be "displayed" 
efficiently on the surface of filamentous phage by inserting short gene fragments into gene II! of the fd 

40 phage. Smith, Science 228:1315-1317 (1985); Parmley and Smith, Gene 73:305-318 (1985). The gene III 
coat protein is present in about five copies at one end of the phage particle. The modified phage were termed 
"fusion phage" because they displayed the foreign peptides fused to the gene II! coat protein. As each fusion 
phage particle displayed approximately five copies of the fusion protein, this mode of phage display was 
termed "polyvalent display 

45 Scon et al. and Cwirla et al. showed that fusion phage libraries could be screened by sequential 

affinity selections known as "panning." Scon et al.. Science 249:386-390 (1990); Cwirla et a/.. PNAS USA 
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87:6378-6382 (1990). However, early efforts to select high affinity fusion phage failed, presumably due to 
the polyvalence of the phage particles. This problem was solved with the development of a "monovalent" 
phage display system in which the fusion protein is expressed at a low level from a phagemid and a helper 
phage provides a large excess of wild-type coat protein. Bass et a!.. Proteins 8:309-3 14 (1990): Lowman et 
at., Biochem. 30: 10832-10838 ( 1991 ). Monovalent phage display can be used to generate and screen a large 
number of variant polypeptides to isolate those that bind with high affinity to a target of interest. 

Approximately 50,000 infants are bom in the United States every year with birth weights, less than 
1.5 kg. About two thirds of these very low birth weight infants have evidence of pulmonary immaturity 
manifested as respiratory distress shortly after birth. The majority of these infants require mechanical 
ventilation. Respiratory distress syndrome, caused by insufficient pulmonary surfactant production, as well 
as structural immaturity of the lung, is responsible for respiratory difficulties observed in these prematurely 
born neonates. Well developed alveoli are necessary to provide efficient oxygen transfer from the air-liquid 
interface of the lung to the systemic circulation. Surfactant proteins are critical in reducing the alveolar 
surface tension at low lung volumes and preventing alveolar collapse. 

A need continues to exist for a method of treatment for respiratory distress syndrome and other 
diseases associated with immature lung development and low lung surfactant production. 

SUMMARY OF THE INVENTION 

In general an object of the invention is to provide a method of inducing epithelial cell growth and 
development for the purpose of promoting repair and healing of tissue damage or injury. 

Accordingly, one object of this invention is to provide a method of treating respiratory distress 
svnclromc in patients, primarily human patients, in need of such treatment. A further object is to provide a 
metnod of inducing lung epithelial cell growth and development. A further object is to provide a method of 
incrc^smy lung surfactant protein A production in the lung of persons with impaired oxygen transfer in the 
luny alveoli. This invention is useful in treating infants/neonates with respiratory distress as well as youth 
and ddults with poor lung function due to lung injury or damage. 

In one aspect of this invention, it has now been discovered that these objects and the broader 
objective of treating conditions associated with epithelial cell damage and injury are achieved by 
administering to a patient in need of such treatment an effective amount of a heregulin ligand, preferably a 
polypeptide or fragment thereof. These heregulin (HRG) polypeptides, include HRG-oc, HRG-PK HRG-P2, 
HRG-[33 and other HRG polypeptides which cross-react with antibodies directed against these family 
members and/or which are substantially homologous as defined below and includes HRG variants such as N- 
terminal and C-terminal fragments thereof. A preferred HRG is the ligand disclosed in Fig. 1A - ID and 
further designated HRG-a. Other preferred HRGs are the ligands disclosed in Figure 2A - 2E, and 
designated HRG-pl; disclosed in Figure 3A - 3E designated HRG-p2: and disclosed in Figure 4A - 4C 
designated HRG-p3. 

In another aspect, the invention provides a method in which HRG agonist antibodies are 
administered to achieve the objects of the invention. In this embodiment, HER2/HER3 or fragments thereof 
(which also may be synthesized by in vitro methods) are fused (by recombinant expression or an in vitro 
peptidyl bond) to an immunogenic polypeptide and this fusion polypeptide, in turn, is used to raise 
antibodies against a HER2/HER3 epitope. Agonist antibodies are recovered from the serum of immunized 
animals. Alternatively, monoclonal antibodies are prepared from in vitro cells or in vivo immunized animals 
in conventional fashion. If desired, the agonist antibodies may be obtained by phage display selection from a 
phage library of antibodies or antibody fragments. Preferred antibodies identified by routine screening will 
bind to the receptor, but will not substantially cross-react with any other known ligands such as EGF, and 
will activate the HER receptors HER2, HER3 and/or HER4. in addition, antibodies may be selected that are 
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capable of binding specifically to individual family members of HRG family, e.g. HRG-ce. HRG-pl, HRG- 

p2, HRG-p3, and which are agonists thereof. 

In general, the invention is a method of regenerating and/or repairing epithelial cell injury by 
stimulating growth and proliferation of epithelial cells, in particular ductal and ciliated epithelial cells. The 
5 epithelial cells may be injured by many rypes of insults, for example, injury due to surgical incision or 
■ resection, chemical or smoke inhalation or aspiration, chemical or biochemical ulceration, cell damage due to 

viral or bacterial infection, etc. The epithelial cells which may be affected by the method of the invention 
include any epithelial cell which expresses HER2, HER3 and/or HER4; suitable cells are located, for 
example, in the lung, gastric mucosa, endometrium, oviducts, mammary glands, pancreas, salivary glands, 
10 etc. The method of the invention stimulates growth and proliferation of the epithelial cells, repairing and re- 
; establishing the cellular barriers of organs and allowing the affected tissues to develop normal physiological 

functions more quickly. For example, lung epithelial cells are damaged by inhalation of smoke resulting in 
j emphysema. Treatment of the lung cells by the method of the invention regenerates the barrier layer of lung 

! epithelial cells, improves oxygenation and speeds the development of a barrier to infection. Similarly, cell 

; 15 damage due to aspiration of gastric acid can be treated by the method of the invention to facilitate 

regeneration of epithelial cells. 

Accordingly, one embodiment of the invention is a method of inducing lung epithelial cell growth 
and development by contacting a lung epithelial cell which expresses HER2, HER3 and/or HER4 receptors 
with an effective amount of a HER2, HER3 and/or HER4 activating ligand. 
20 Another embodiment is a method of increasing lung surfactant protein A in a patient by 

administering to a patient in need thereof an effective amount of a HER2, HER3 and/or HER4 activating 
ligand. 

A further embodiment is a method of treating respiratory distress by administering to a patient in 
need thereof an effective amount of a HER2, HER3 and/or HER4 activating ligand. 
25 A further embodiment is a method of treating emphysema by administering to a patient in need 

^3r^:^-<MC'"*^^--»-'t^c r eof an effective amount of a HER2, HER3 and/or HER4 activating ligand. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1A - ID show the deduced amino acid sequence (SEQ ID NO: I) for the cDNA sequence 
(SEQ ID NO:2) contained in a clone obtained according to U.S. 5,367,060. The initiating methionine (Met) 

30 of HRG-ot is at position 45. 

Figure 2A - 2E show the deduced amino acid sequence (SEQ ID NO:3) and cDNA sequence (SEQ 
ID NO:4) of a potential coding sequence of a clone obtained according to U.S. 5,367,060 for HRG-Pl. The 
initiating Met is at M31. 

Figure 3A - 3E show the deduced amino acid sequence (SEQ ID NO:5) and cDNA sequence (SEQ 
35 ID NO:6) of a nucleotide sequence of a clone obtained according to U.S. 5,367,060 for HRG-p2. 

Figure 4A - 4C show the deduced amino acid sequence (SEQ ID NO:7) and cDNA sequence (SEQ 
ID NO:8) of a nucleotide sequence of a clone obtained according to U.S. 5,367,060 for HRG-p3. 

Figure 5A - 5D show the deduced amino acid sequence (SEQ ID NO:9) and cDNA sequence (SEQ 
ID NO: 10) of a nucleotide sequence of a clone obtained according to U.S. 5,367,060 for HRG-P2-like 
40 protein. 

Figure 6A - 6C show a comparison of the amino acid homologies of several known heregulins a, 
Pi, p2, p2-like and p3 in descending order and illustrates the amino acid insertions, deletions, and 
substitutions that characterize these forms of HRG (SEQ ID NOS: !, 3, 5, 9, and 7). 

Figure 7A - 7C show the deduced amino acid sequence (SEQ ID NO: 1 1) and cDNA sequence (SEQ 
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ID N0:12) of v-HRG obtained as described in U.S. Serial No. 08/891.845. The hydrophobic region is 
underlined. The EGF-tike domain is shaded, cysteine residues in the EGF-like domain are circled. N-linked 
glycosylation sites are marked above the nucleic acid sequence with a ( ). 

Figure 8 shows the cDNA sequence (SEQ ID NO: 13) and amino acid sequence (SEQ ID NO: 14) of 
SMDF obtained as described in U.S. Serial No. 08/339,517. An EGF-like domain and the apolar and 
uncharged domains (i.e. "apolar I" consisting of residues from about 48-62 and "apolar II" consisting of 
residues from about 76-100) are underlined. Cysteines in the EGF-like domain and in the "cysteine knot" in 
the unique N-terminal domain ("NTD-cys knot") are boxed. The stop codon is denoted by the letter "O". 
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

HRG iigands, in particular polypeptides and agonist antibodies thereof, have affinity for and 
stimulate the HER2, HER3 and/or HER4 receptors or combinations thereof in autophosphorylation. 
Included within the definition of HRG Iigands, in addition to HRG-a, HRG-Pl, HRG-p2, HRG-P3 and 
HRG-P2-like. are other polypeptides binding to the HER2, HER3 and/or HER4 receptor, which bear 
substantial amino acid sequence homology to HRG-ct or HRG-Pl. Such additional polypeptides fall within 
the definition of HRG as a family of polypeptide iigands that bind to the HER2. HER3 and/or HER4 
receptors. 

Hereguiin polypeptides bind with varying affinities to the HER2. HER3 and/or HER4 receptors. 
Generally, the HER3 and HER4 receptors are bound with high affinity. It is also known that 
heterodimerization of HER2 with HER3 and of HER2 with HER4 occurs with subsequent receptor cross- 
phosphorylation as described above. In the present invention, epithelial cell growth and/or proliferation is 
induced when a hereguiin protein interacts and binds with an individual receptor molecule or a receptor 
dimer such that receptor phosphorylation is induced. Binding and activation of HER2, HER3, HER4 or 
combinations thereof, therefore, is meant to include activation of any form of the receptor necessary for 
receptor activation and biologic function including monomeric receptor and dimeric receptor forms. Dimeric 
receptor forms may be referred to below, for example, as HER2/HER3, HER2/HER4. and HER3/HER4. 
I. DEFINITIONS 

In general, the following words or phrases have the indicated definition when used in the 
description, examples, and claims. 

"Hereguiin" (HRG) ligand is defined herein to be any isolated ligand, preferably a polypeptide 
sequence which possesses a biological property of a naturally occurring HRG polypeptide. Ligands within 
the scope of this invention include the NDF. ARIA and GGF growth factor hereguiin proteins identified 
above as well as the SMDF and HRG polypeptides discussed in detail herein. These isolated NDF, ARIA 
and GGF hereguiin polypeptides are well known in the art. HRG includes the polypeptides shown in Figs. 
1A-ID, 2A-2E, 3A-3E. 4A-4C, 5A-5D, 6A-6C, 7A-7C and 8 and mammalian analogues thereof. Included 
are HRG variants such as the y-HRG described in WO 98/02541, published 22 January 1998: the variants 
described in WO 98/35036, published 13 August 1998; and the SMDF variants described in US Patent NO. 
5,770,567 granted 23 June 1998 (WO 96/15244). These applications are incorporated herein in their 
entirety. These variants can be prepared by the methods described below, optionally together with alanine 
scanning and phage display techniques known in the art. Cunningham and Wells. Science 244:1081-85 
(1989); Bass et aU Proteins 8:309-14 (1990); Lowman et aL, Biochem. 30: 10832-38 (1991 ). 

The term a "normal" epithelial cell means an epithelial cell which is not transformed, i.e., is non- 
cancerous and/or non-immortalized. Further, the normal epithelial cell is preferably not aneuploid. 
Aneuploidy exists when the nucleus of a cell does not contain an exact multiple of the haploid number of 
chromosomes, one or more chromosomes being present in greater or lesser number than the rest. Typical 
properties of transformed cells which fall outside the scope of this invention include the ability to form 
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tumors when implanted into immune-deprived mice (nude mice), the ability to grow in suspension or in 
semi-solid media such as agar, a loss of contact inhibition allowing piling up of cells into colonies or foci, a 
ioss of dependence on growth factors or serum, cell death if cells are inhibited from growing, and 
disorganization of actin filaments. Specifically included within the invention are normal epithelial cells 
5 which will not form tumors in mice, grow attached to plastic or glass (are anchorage dependent), exhibit 
contact inhibition, require serum-containing hormones and growth factors, remain viable if growth is arrested 
by lack of serum, and contain well-organized actin filaments. Although the normal epithelial cells are 
preferably not cultured cells, also suitable for the invention are non -transformed, non-immortalized 
epithelial cells isolated from mammalian tissue. These isolated cells may be cultured for several generations 
10 (up to about 10 or even 50 generations) in the presence of a heregulin in order to induce growth and/or 
proliferation of the isolated epithelial cell sample, that is, to expand the sample. The expanded sample can 
then be reintroduced into the mammal for the purpose of repopulating the epithelial cell tissue (re- 
epithelialization). This is particularly useful for repairing tissue injury or damage. 

An "epithelial" cell is a cell located in a cellular, avascular layer covering the free surface 
I 5 (cutaneous, mucous or serous) of an organ or lining a rube or cavity of an animal body. Lung epithelial cells 
include bronchial epithelial cells, Type II cells and Clara cells. The term "epithelial cell" as used herein is 
consistent with the an recognized definition of epithelial cells in epithelium. See. for example, the definition 
in Taber's Encyclopedic Medical Dictionary, Edition 12, (1973) F.A. Davis Company, publisher. 

"Biological property" for the purposes herein means an in vivo biologic or antigenic function or 
?»• acii\it> that is directly or indirectly performed by an HRG sequence (whether in its native or denatured 
tonnirmation), or by any subsequence thereof. Biologic functions include receptor binding, any enzyme 
acii\it\ or enzyme modulatory activity, any carrier binding activity, any hormonal activity, any activity in 
promoimt: or inhibiting adhesion of cells to extracellular matrix or cell surface molecules, or any structural 
role However, biologic functions do not include antigenic functions, i.e. possession of an epitope or 
2.^ antigenic site that is capable of cross-reacting with antibodies raised against a naturally occurring HRG 
Em.*?'.yv : * mo 6s?r-- > ^pd \ y pc pt i d e . 

"Biologically active" HRG is defined herein as a polypeptide sharing a biologic function of an HRG 
sequence which may (but need not) in addition possess an antigenic function. A principal known effect r.- 
funcnon of HRG is as a iigand polypeptide having a qualitative biological activity of binding to HER2, 
30 HER3 and/or HER4 resulting in the activation of the receptor tyrosine kinase (an "activating ligand"). One 
test lor activating iigands is the HRG tyrosine autophosphorylation assay described below. Included within 
the scope of HRG as that term is used herein are HRG having translated mature amino acid sequences of the 
complete human HRG as set forth herein; deglycosylated or unglycosylated derivatives of HRG, amino acid 
sequence variants of HRG sequence, and derivatives of HRG, which are capable of exhibiting a biological 
35 property in common with HRG. While native HRG is a membrane-bound polypeptide, soluble forms, such 
as those forms lacking a functional transmembrane domain, are also included within this definition. In 
particular, included are polypeptide fragments of HRG sequence which have an N-terminus at any residue 
from about S216 to about A227, and its C-terminus at any residue from about K268 to about R286, and the 
homologous sequences shown in Fig. 6A-C, hereinafter referred to collectively for all HRGs as the growth 
40 factor domain (GFD). 

"Antigenically active" HRG is defined as a polypeptide that possesses an antigenic function of an 
HRG and which may (but need not) in addition possess a biologic function. 

In preferred embodiments, antigenically active HRG is a polypeptide that binds with an affinity of 
at least about 10"^ l/mole to an antibody raised against a naturally occurring HRG sequence. Ordinarily the 

45 polypeptide binds with an affinity of at least about 10"** I/mole. Most preferably, the antigenically active 
HRG is a polypeptide that binds to an antibody raised against one of HRGs in its native conformation. HRG 
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in its native conformation generally is HRG as found in nature which has not been denatured by chaotropic 
agents, heat or other treatment that substantially modifies the three dimensional structure of HRG as 
determined, for example, by migration on nonreducing, nondenaturing sizing gels. Antibody used in this 
determination may be rabbit polyclonal antibody raised by formulating native HRG from a non-rabbit 
species in Freund's complete adjuvant, subcutaneously injecting the formulation, and boosting the immune 
response by intraperitoneal injection of the formulation until the titer of anti-HRG antibody plateaus. 

Ordinarily, biologically or antigenically active HRG will have an amino acid sequence having at 
least 75% amino acid sequence identity with a given HRG sequence, more preferably at least 80%, even 
more preferably at least 90%, and most preferably at least 95%. Identity or homology with respect to an 
HRG sequence is defined herein as the percentage of amino acid residues in the candidate sequence that are 
identical with HRG residues in the HRG of Fig. 6A-6C, after aligning the sequences and introducing gaps, if 
necessary, to achieve the maximum percent homology, and not considering any conservative substitutions as 
pan of the sequence identity. None of N-terminal. C-terminal or internal extensions, deletions, or insertions 
into HRG sequence shall be construed as affecting homology. 

Thus, the biologically active and antigenically active HRG polypeptides that are the subject of this 
invention include each entire HRG sequence; fragments thereof having a consecutive sequence of at least 5, 
10, 15, 20, 25. 30 or 40 amino acid residues from HRG sequence; amino acid sequence variants of HRG 
sequence wherein an amino acid residue has been inserted N- or C-terminal to, or within, HRG sequence or 
its fragment as defined above; amino acid sequence variants of HRG sequence or its fragment as defined 
above has been substituted by another residue. HRG polypeptides include those containing predetermined 
mutations by, e.g., site-directed or PCR mutagenesis, and other animal species of HRG polypeptides such as 
raooit. rat, porcine, non-human primate, equine, murine, and ovine HRG and alleles or other naturally 
occurring variants of the foregoing and human sequences; derivatives of HRG or its fragments as defined 
above wherein HRG or its fragments have been covalently modified by substitution, chemical, enzymatic, or 
other appropriate means with a moiety other than a naturally occurring amino acid (for example a detectable 
moiety such as an enzyme or radioisotope); giycosylation variants of HRG (insertion of a giycosylation site 
or deletion of any giycosylation site by deletion, insertion or substitution of appropriate amino acid); and 
soluble forms of HRG, such as HRG-GFD or those that lack a functional transmembrane domair 

"Isolated" means a ligand, such as HRG, which has been identified and separated and/or recovered 
from a component of its natural environment. Contaminant components of its natural environment are 
materials which would interfere with diagnostic or therapeutic uses for HRG, and may include proteins, 
hormones, and other substances. In preferred embodiments, HRG will be purified ( 1 ) to greater than 95% by 
weight of protein as determined by the Lowry method or other validated protein determination method, and 
most preferably more than 99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N- 
terminal or internal amino acid sequence by use of the best commercially available amino acid sequenator 
marketed on the filing date hereof, or (3) to homogeneity by SDS-PAGE using Coomassie blue or, 
preferably, silver stain. Isolated HRG includes HRG m situ within recombinant cells since at least one 
component of HRG natural environment will not be present. Isolated HRG includes HRG from one species 
in a recombinant cell culture of another species since HRG in such circumstances will be devoid of source 
polypeptides. Ordinarily, however, isolated HRG will be prepared by at least one purification step. 

In accordance with this invention, HRG nucleic acid is RNA or DNA containing greater than ten 
bases that encodes a biologically or antigenically active HRG, is complementary to nucleic acid sequence 
encoding such HRG, or hybridizes to nucleic acid sequence encoding such HRG and remains stably bound to 
it under stringent conditions. 

Preferably, HRG nucleic acid encodes a polypeptide sharing at least 75% sequence identity, more 
preferably at least 80%, still more preferably at least 85%, even more preferably at 90%, and most preferably 
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95%, with an HRG sequence. Preferably, the HRG nucleic acid that hybridizes contains at least 20, more 
preferably at least about 40, and most preferably at least about 90 bases. 

Isolated HRG nucleic acid includes a nucleic acid that is identified and separated from at least one 
containment nucleic acid with which it is ordinarily associated in the natural source of HRG nucleic acid. 
5 Isolated HRG nucleic acid thus is present in other than in the form or setting in which it is found in nature. 
However, isolated HRG encoding nucleic acid includes HRG nucleic acid in ordinarily HRG -expressing ceils 
where the nucleic acid is in a chromosomal location different from that of natural cells or is otherwise 
flanked by a different DNA sequence than that found in nature. Nucleic acid encoding HRG may be used in 
specific hybridization assays, particularly those portions of HRG encoding sequence that do not hybridize 
10 with other known DNA sequences. "Stringent conditions" are those that (I) employ low ionic strength and 
high temperature for washing, for example, 0.015 M NAC1/0.0015 M sodium citrate/0/1% NaDodS0 4 at 50° 

C; (2) employ during hybridization a denaturing agent such as formamide, for example. 50% (vol/vol) 

formamide with 0.1% bovine serum albumin, 0.1% Ficoll, 0.1% polyvinylpyrrolidone, 50 mM sodium 

phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42° C; or (3) employ 50% 
15 formamide, 5 x SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% 

sodium pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 g/ml), 0.1% SDS, and 

10% dexrran sulfate at 42° C, with washes at 42° C in 0.2 x SSC and 0.1% SDS, 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 

linked coding sequence in a particular host organism. The control sequences that are suitable for 
20 prokaryotes, for example, include a promoter, optionally an operator sequence, a ribosome binding site, and 

possibly, other as yet poorly understood sequences. Eukaryotic cells are known to utilize promoters, 

polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another 

nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA 
25 for a polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a 
... promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; 

or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate 

translation. Generally, "operably linked" means that the DNA sequences being linked are contiguous and, in 

the case of a secretory leader, contiguous and in reading phase. However enhancers do not have to be 
30 contiguous. Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, 

then synthetic oligonucleotide adaptors or linkers are used in accord with conventional practice. 

An "exogenous" element is defined herein to mean nucleic acid sequence that is foreign to the cell, 

or homologous to the cell but in a position within the host cell nucleic acid in which the element is ordinarily 

not found. 

35 As used herein, the expressions "cell", "cell line", and "cell culture" are used interchangeably, and 

all such designations include progeny. Thus, the words "transformants" and "transformed cells" include the 
primary subject cell and cultures derived therefrom without regard for the number of transfers. It is also 
understood that all progeny may not be precisely identical in DNA content, due to deliberate or inadvertent 
mutations. Mutant progeny that have the same function or biological activity as screened for in the 

40 originally transformed cell are included. It will be clear from the context where distinct designations are 
intended. 

"Plasmids" are designated by a lower case "p" preceded and/or followed by capital letters and/or 
numbers. Hie starting plasmids herein are commercially available, are publicly available on an unrestricted 
basis, or can be constructed from such available plasmids in accord with published procedures. In addition, 
45 other equivalent plasmids are known in the an and will be apparent to the ordinary artisan. 
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"Restriction enzyme digestion" of DNA refers to catalytic cleavage of the DNA with an enzyme that 
acts only at certain locations in the DNA. Such enzymes are called restriction endonucleases, and the sites 
for which each is specific is called a restriction site. The various restriction enzymes used herein are 
commercially available and their reaction conditions, cofactors. and other requirements as established by the 
5 enzyme suppliers are used. Restriction enzymes commonly are designated by abbreviations composed of a 
capital letter followed by other letters representing the microorganism from which each restriction enzyme 
originally was obtained, and then a number designating the particular enzyme. In general, about 1 mg of 
plasmid or DNA fragment is used with about 1-2 units of enzyme in about 20 ml of buffer solution. 
Appropriate buffers and substrate amounts for particular restriction enzymes are specified by the 

10 manufacturer, incubation of about 1 hour at 37°C is ordinarily used, but may vary in accordance with the 
supplier's instructions. After incubation, protein or polypeptide is removed by extraction with phenol and 
chloroform, and the digested nucleic acid is recovered from the aqueous fraction by precipitation with 
ethanoL Digestion with a restriction enzyme may be followed with bacterial alkaline phosphatase hydrolysis 
of the terminal 5' phosphates to prevent the two restriction cleaved ends of a DNA fragment from 

15 "circularizing" or forming a closed loop that would impede insertion of another DNA fragment at the 
restriction site. Unless otherwise stated, digestion of plasmids is not followed by 5' terminal 
dephosphorylation. Procedures and reagents for dephosphorylation are conventional as described in sections 
1.56-1.61 of Sambrook et ai, (Molecular Cloning: A Laboratory Manual New York: Cold Spring Harbor 
Laboratory Press, 1989). 

20 "Recovery" or "isolation" of a given fragment of DNA from a restriction digest means separation of 

the digest on polyacrylamide or agarose gel by electrophoresis, identification of the fragment of interest by 
comparison of its mobility versus that of marker DNA fragments of known molecular weight, removal of the 
gel section containing the desired fragment, and separation of the gel from DNA. This procedure is known 
generally. For example, see Lawn et al. y Nucleic Acids Res. 9:6103-61 14 (198 1 ), and Goeddel et ai. Nucleic 

25 Acids Res. 8:4057 1980). 

"Northern analysis" is a method used to identify RNA sequences that hybridize to a known probe 
such as an oligonucleotide, DNA fragment, cDNA or fragment thereof, or RNA fragment. The probe is 
labeled with a radioisotope such as 5 ^P. or by biotinylation, or with an enzyme. The RNA to be analyzed is 
usually electrophoretically separated on an agarose or polyacrylamide gel, transferred to nitrocellulose, 

30 nylon, or other suitable membrane, and hybridized with the probe, using standard techniques well known in 
the art such as those described in sections 7.39-7.52 of Sambrook et al., supra. 

"Ligation" refers to the process of forming phosphodiester bonds between two nucleic acid 
fragments. To ligate the DNA fragments together, the ends of the DNA fragments must be compatible with 
each other. In some cases, the ends will be directly compatible after endonuclease digestion. However, it 

35 may be necessary to first convert the staggered ends commonly produced after endonuclease digestion to 
blunt ends to make them compatible for ligation. To blunt the ends, the DNA is treated in a suitable buffer 
for at least 15 minutes at 15°C with about 10 units of the Klenow fragment of DNA polymerase I or T4 DNA 
polymerase in the presence of the four deoxyribonucleotide triphosphates. The DNA is then purified by 
phenol-chloroform extraction and ethanol precipitation. The DNA fragments that are to be ligated together 

40 are put in solution in about equimolar amounts. The solution will also contain ATP, ligase buffer, and a 
ligase such as T4 DNA ligase at about 10 units per 0.5 mg of DNA. If the DNA is to be ligated into a vector, 
the vector is first linearized by digestion with the appropriate restriction endonuclease(s). The linearized 
fragment is then treated with bacterial alkaline phosphatase, or calf intestinal phosphatase to prevent self- 
ligation during the ligation step. 

45 "Preparation" of DNA from cells means isolating the plasmid DNA from a culture of the host cells. 

Commonly used methods for DNA preparation are the large and small-scale plasmid preparations described 
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in sections 1.25-1.33 of Sambrook et ai. supra. After preparation of the DNA. it can be purified by methods 
well known in the art such as that described in section 1 .40 of Sambrook et al. t supra. 

"Oligonucleotides" are short-length, single- or double-stranded polydeoxynucleotides that are 
chemically synthesized by known methods (such as phosphotriester, phosphite, or phosphoramidite 
5 chemistry, using solid phase techniques such as described in EP 266,032. published 4 May 1988, or via 
deoxynucleoside H-phosphonate intermediates as described by Froehier et ai % NucL Acids Res. 14:5399- 
5407, 1986. They are then purified on polyacrylamide gels. 

The technique of "polymerase chain reaction," or "PCR," as used herein generally refers to a 
procedure wherein minute amounts of a specific piece of nucleic acid, RNA and/or DNA, are amplified as 

10 described in U.S. Pat. No. 4,683,195, issued 28 July 1987. Generally, sequence information from the ends of 
the region of interest or beyond needs to be available, such that oligonucleotide primers can be designed; 
these primers will be identical or similar in sequence to opposite strands of the template to be amplified. The 
5' terminal nucleotides of the two primers may coincide with the ends of the amplified material. PCR can be 
used to amplify specific RNA sequences, specific DNA sequences from total genomic DNA, and cDNA 

15 transcribed from total cellular RNA. bacteriophage or plasmid sequences, etc. See generally Mullis et ai, 
Cold Spring Harbor Symp. Quant. Biol. 51: 263 (1987); Erlich, ed., PCR Technology, (Stockton Press. NY, 
1989). As used herein, PCR is considered to be one, but not the only, example of a nucleic acid polymerase 
reaction method for amplifying a nucleic acid test sample, comprising the use of a known nucleic acid (DNA 
or RNA) as a primer, and utilizes a nucleic acid polymerase to amplify or generate a specific piece of nucleic 

20 acid or to amplify or generate a specific piece of nucleic acid which is complementary to a particular nucleic 
acid. 

The "HRG tyrosine autophosphorylation assay' to detect the presence or bioactiviry of HRG ligands 
can be used to monitor the purification of a ligand for the HER2 and HER3 receptors. This assay is based on 
the assumption that a specific ligand for the receptor will stimulate autophosphorylation of the receptor, in 

25 analogy with EOF and its stimulation of EGF receptor autophosphorylation. See Sadich et ah. Anal. 
^ ^BJoahem. 235:207-214 (1996). MDA-MB-453 cells or MCF7 cells which contain high levels of pl85 HER2 
receptors but negligible levels of human EGF receptors, were obtained from the American Type Culture 
Collection, Rockville, Md. (ATCC No HTB-13 1) and maintained in tissue culture with 10% fetal calf serum 
in DMEM/Hams F12 (1:1) media. For assay, the cells were trypsinized and plated at 150,000 cells/well in 

30 24 well dishes (Costar). After incubation with serum containing media overnight, the ceils were placed in 
serum free media for 2-18 hours before assay. Test samples of 100 uL aliquots were added to each well. 
The cells were incubated for 5-30 minutes (typically 30 min) at 37°C and the media removed. The cells in 
each well were treated with 100 uL SDS gel denaturing buffer (SEPROSOL. Enpotech, Inc.) and the plates 
heated at 100°C for 5 minutes to dissolve the cells and denature the proteins. Aliquots from each well were 

35 electrophoresed on 5-20% gradient SDS gels (NOVEX, Encinitas, CA) according to the manufacturer's 
directions. After the dye front reached the bottom of the gel, the electrophoresis was terminated and a sheet 
of PVDF membrane (PROBLOTT, ABI) was placed on the gel and the proteins transferred from the gel to 
the membrane in a bloning chamber (BioRad) at 200 mAmps for 30-60 min. After blotting, the membranes 
were incubated with TR1S buffered saline containing 0.1% TWEEN 20 detergent buffer with 5% BSA for 2- 

40 18 hrs to block nonspecific binding, and then treated with a mouse anti-phosphotyrostne antibody (Upstate 
Biological Inc., N.Y.). Subsequently, the membrane blots were treated with goat anti-mouse antibody 
conjugated to alkaline phosphatase. The gels were developed using the PROTOBLOT System from 
Promega. After drying the membranes, the density of the bands corresponding to pi 85 HER2 in each sample 
lane was quantitated with a Hewlett Packard SCANJET Plus Scanner attached to a Macintosh computer. The 

45 number of receptors per cell in the MDA-MB-453 cells is such that under these experimental conditions the 
p!85 H ^R2 receptor protein is the major protein which is labeled. 
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"Protein microsequehcing" was accomplished based upon the following procedures. Proteins from 
the final H PLC step were either sequenced directly by automated Edman degradation with a model 470A 
Applied Biosystems gas phase sequencer equipped with a 120 A PTH amino acid analyzer or sequenced after 
digestion with various chemicals or enzymes. PTH amino acids were integrated using the 
5 CHROMPERFECT data system (Justice Innovations, Palo Alto, CA). Sequence interpretation was 
performed on a VAX 11/785 Digital Equipment Corporation computer as described (Henzel et a/., J. 
Chromatography 404:41-52 (1987)). In some cases, aiiquots of the HPLC fractions were electrophoresed 
on 5-20% SDS polyacryl amide gels, electrotransferred to a PVDF membrane (PROBLOTT, ABI, Foster 
City, CA) and stained with Coomassie Brilliant Blue (Matsudaira, P., J. Biol. Chem. 262:10035-10038, 

10 1987). The specific protein was excised from the blot for N terminal sequencing. To determine internal 
protein sequences, HPLC fractions were dried under vacuum (SPEEDVAC), resuspended in appropriate 
buffers, and digested with cyanogen bromide, the lysine-specific enzyme Lys-C (Wako Chemicals, 
Richmond, VA) or Asp-N (Boehringer Mannheim, Indianapolis, Ind.). After digestion, the resultant peptides 
were sequenced as a mixture or were resolved by HPLC on a C4 column developed with a propanol gradient 

15 in 0.1%TFA before sequencing as described above. 

"Antibodies" (Abs) and "immunoglobulins" (lgs) are glycoproteins having the same structural 
characteristics. While antibodies exhibit binding specificity to a specific antigen, immunoglobulins include 
both antibodies and other antibody-like molecules which lack antigen specificity. Polypeptides of the latter 
kind are, for example, produced at low levels by the lymph system and at increased levels by myelomas. 

20 Papain digestion of antibodies produces two identical antigen-binding fragments, called "Fab" 

fragments, each with a single antigen-binding site, and a residual "Fc" fragment, whose name reflects its 
ability to crystallize readily. Pepsin treatment yields an F(ab') 2 fragment that has two antigen-combining 

sites and is still capable of cross-linking antigen. 

"Fv M is the minimum antibody fragment which contains a complete antigen-recognition and -binding 

25 site. This region consists of a dimer of one heavy chain and one light chain variable domain in tight, non- 
covatent association. It is in this configuration that the three hypervariable regions of each variable domain 
interact to define an antigen-binding site on the surface of the V H -V L dimer. Collectively, the six 
hypervariable regions confer antigen-binding specificity to the antibody. However, even a single variable 
domain (or half of an Fv comprising only three hypervariable regions specific for an antigen ) has the ability 

30 to recognize and bind antigen, although at a lower affinity than the entire binding site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab* fragments differ from Fab fragments by the addition of a few residues at the 
carboxyl terminus of the heavy chain CHI domain including one or more cysteine(s) from the antibody 
hinge region. Fab'-SH is the designation herein for Fab' in which the cysteine residue(s) of the constant 

35 domains bear a free thiol group. F(ab*) 2 antibody fragments originally were produced as pairs of Fab' 
fragments which have hinge cysteines between them. Other chemical couplings of antibody fragments are 
also known. 

The "light chains" of antibodies (immunoglobulins) from any vertebrate species can be assigned to one 
of two clearly distinct types, called kappa (k) and lambda (X), based on the amino acid sequences of their 
40 constant domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD, IgE, IgG, 
and IgM and several of these may be further divided into subclasses (isotypes), e.g., IgG K lgG2, lgG3, 
IgG4, IgA, and IgA2. The heavy-chain constant domains that correspond to the different classes of 
45 immunoglobulins are called a, 6, £, and u., respectively. The subunit structures and three-dimensional 
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configurations of different classes of immunoglobulins are well known. 

The term "antibody" herein is used in the broadest sense and specifically covers monoclonal antibodies 
(including full length monoclonal antibodies), polyclonal antibodies, multispecific antibodies (e.g., bispecific 
antibodies), and antibody fragments so long as they exhibit the desired biological activity. 

"Antibody fragments" comprise a portion of a full length antibody, generally the antigen binding or 
variable domain thereof Examples of antibody fragments include Fab, Fab\ F(ab')2, and Fv fragments; 
diabodies; linear antibodies; single-chain antibody molecules; and multispecific antibodies formed from 
antibody fragments. 

The term "monoclonal antibody" as used herein refers to an antibody obtained from a population of 
substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical 
except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal 
antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to 
conventional (polyclonal) antibody preparations which typically include different antibodies directed against 
different determinants (epitopes), each monoclonal antibody is directed against a single determinant on the 
antigen. The modifier "monoclonal" indicates the character of the antibody as being obtained from a 
substantially homogeneous population of antibodies, and is not to be construed as requiring production of the 
antibody by any particular method. For example, the monoclonal antibodies to be used in accordance with 
the present invention may be made by the hybridoma method first described by Kohler et a!.. Nature 
256:495 (1975), or may be made by recombinant DNA methods (see. e.g., U.S. Patent No. 4,816,567). The 
"monoclonal antibodies" may also be isolated from phage antibody libraries using the techniques described 
in Ciackson et ai, Nature 352:674-628 (1991) and Marks et ai t J. Moi Biol. 222:581-597 (1991), for 
example. 

The monoclonal antibodies herein specifically include "chimeric" antibodies (immunoglobulins) in 
which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences 
in antibodies derived from a particular species or belonging to a particular antibody class or subclass, while 
the remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies 
derived from another species or belonging to another antibody class or subclass, as well as fragments of such 
antibodies, so long as they exhibit the desired biological activity (U.S. Patent No. 4,816,567; and Morrison et 
ai. Proc. Nad. Acad. Set. USA 81:6851-6855 (1984)). 

"Humanized" forms of non-human {e.g., murine) antibodies are chimeric antibodies which contain 
minimal sequence derived from non-human immunoglobulin. For the most part, humanized antibodies are 
human immunoglobulins (recipient antibody) in which hypervariable region residues of the recipient are 
replaced by hypervariable region residues from a non-human species (donor antibody) such as mouse, rat, 
rabbit or nonhuman primate having the desired specificity, affinity, and capacity. In some instances, 
framework region (FR) residues of the human immunoglobulin are replaced by corresponding non-human 
residues. Furthermore, humanized antibodies may comprise residues which are not found in the recipient 
antibody or in the donor antibody. These modifications are made to further refine antibody performance. In 
general, the humanized antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the hypervariable regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the FRs are those of a human immunoglobulin sequence. The 
humanized antibody optionally also will comprise at least a portion of an immunoglobulin constant region 
(Fc), typically that of a human immunoglobulin. For further details, see Jones et ai. Nature 321:522-525 
(1986); Reichmann et at., Nature 332:323-329 (1988); and Presta, Curr. Op. Struct. Biol. 2:593-596 (1992). 

"Single-chain Fv" or "sFv" antibody fragments comprise the V H and V L domains of antibody, wherein 
these domains are present in a single polypeptide chain. Generally, the Fv polypeptide further comprises a 
polypeptide linker between the V H and V L domains which enables the sFv to form the desired structure for 
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antigen binding. For a review of. sFv see Pluckthun in The Pharmacology of Monoclonal Antibodies, vol. 
1 13, Rosenburg and Moore eds. Springer- Verlag, New York, pp. 269-3 15 (1994). 

The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 
fragments comprise a heavy chain variable domain (V H ) connected to a light chain variable domain (V L ) in 

5 the same polypeptide chain (V H - V L ). By using a linker that is too short to allow pairing between the two 
domains on the same chain, the domains are forced to pair with the complementary domains of another chain 
and create two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 
93/1 1 161; and Hollinger et ai. Proc. Natl. Acad ScL USA 90:6444-6448 ( 1993). 

The expression "linear antibodies" when used throughout this application refers to the antibodies 
10 described in Zapata et ai. Protein Eng. 8(10): 1057- 1 062 (1995). Briefly, these antibodies comprise a pair of 
tandem Fd segments ( v h"^H 1 * v H"^H i ) which form a pair of antigen binding regions. Linear antibodies 
can be bispecific or monospecific. 

II. USE AND PREPARATION OF HRG SEQUENCES 

H. PREPARATION OF HRG SEQUENCES, INCLUDING VARIANTS 

15 The system to be employed in preparing HRG sequence will depend upon the particular HRG 

sequence selected. If the sequence is sufficiently small HRG may be prepared by in vitro polypeptide 
synthetic methods. Most commonly, however, HRG will be prepared in recombinant cell culture using the 
host-vector systems described below. Suitable HRG includes any biologically active and antigenetically 
active HRG. 

20 In general, mammalian host cells will be employed, and such hosts may or may not contain post- 

translational systems for processing HRG preprosequences in the normal fashion. If the host cells contain 
such systems then it will be possible to recover natural subdomain fragments such as HRG-GFD from the 
cultures. If not, then the proper processing can be accomplished by transforming the hosts with the required 
enzyme(s) or by supplying them in an in vitro method. However, it is not necessary to transform cells with 

25 the complete prepro or structural genes for a selected HRG when it is desired to only produce fragments of 
HRG sequences such as an HRG-GFD. For example, a start codon is ligated to the 5' end of DNA encoding 
an HRG-GFD. this DNA is used io transform host cells and the product expressed directly as the Met N- 
tcrminal form (if desired, the extraneous Met may be removed in vitro or by endogenous N-terminal 
demethionylases). Alternatively, HRG-GFD is expressed as a fusion with a signal sequence recognized by 

30 the host cell, which will process and secrete the mature HRG-GFD as is further described below. Amino 
acid sequence variants of native HRG-GFD sequences are produced in the same way. 

HRG sequences located between the first N-terminal mature residue and the first N-terminal residue 
of HRG-GFD sequence, termed HRG-NTD, may function at least in part as an unconventional signal 
sequence or as a normally circulating carrier/precursor for HRG-GFD having unique biological activity. 

35 HRG-NTD is produced in the same fashion as the full length molecule but from expression of DNA in which 
a stop codon is located at the C-terminus of HRG-NTD. In addition, HRG variants are expressed from DNA 
encoding protein in which both the GFD and NTD domains are in their proper orientation but which contain 
an amino acid insertion, deletion or substitution at the GFD-NTD cleavage site (located within the sequence 
VKC) which inhibits or prevents proteolytic cleavage of the NTD-GFD joining site in vivo, and wherein a 

40 stop codon is positioned at the 3* end of the GFD- encoding sequence. In an example of this group of 
variants (termed HRG-NTDXGFD), (1) the lysine residue found in the NTD-GFD joining sequence VKC is 
deleted or (preferably) substituted by another residue other than arginyl such as histidyl, aianyl, threonyl or *v x w 
seryl and (2) a stop codon is introduced in the sequence RCT or RCQ in place of cysteinyl, or threonyl (for 
HRG-a) or glutaminyl (for HRG-p). 
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A preferred HRG-aligand with binding affinity 10 pl85 HtK - comprises amino acids 226-265 of 
figure 1A-D. This HRG-a iigand further may comprise up to an additional 1-20 amino acids preceding 
amino acid 226 and 1-20 amino acids following amino acid 265. A preferred HRG-p Iigand with binding 
affinity to p 1 85*"*ER2 comprises amino acids 226-265 of figure 2A-E. This HRG-P Iigand may comprise up 
5 to an additional 1 -20 amino acids preceding amino acid 226 and 1-20 amino acids following amino acid 265. 

As noted above, other HRG sequences to be prepared in accordance with this invention are those of 
the GFD. These are synthesized in vitro or are produced in recombinant cell culture. These are produced 
most inexpensively in yeast or E.coii by secretion under the control of a HRG-heterologous signal as 
described infra, although preparation in mammalian cells is also contemplated using a mammalian protein 
10 signal such as that of tPA, UK or a secreted viral protein. The GFD can be the sequence of a native HRG or 
may be a variant thereof as described below. GFD sequences include those in which one or more residues 
from a member of the EGF family are substituted into or onto the GFD sequence. 

An additional HRG is one which contains the GFD and the sequence between the C-terminus of 
GFD and the N-terminus of the transmembrane domain (the later being termed the C-terminal cleavage 
15 domain or CTC). in this variant (HRG-GFD-CTC) the DNA start codon is present at the 5' end of HRG- 
heterologous signal sequence o;- adjacent the 5' end of the GFD-encoding region, and a stop codon is found 
in place of one of the first about 1 to 3 extra-cellular domain (ECD) residues or first about 1-2 
transmembrane region residues. In addition, in some HRG-GFD-CTC variants the codons are modified in 
the GFD-CTC proteolysis site by substitution, insertion or deletion. The GFD-CTC proteolysis site is the 
20 domain that contains the GFD C-terminal residue and about 5 residues N- and 5 residues C-terminal from 
this residue. It is known that Met-227 terminal and VaI-229 terminal HRG-a-GFD are biologically active. 
The C-terminus for HRG-a-GFD may be Met-227, Lys-228, Val-229, Gln-230, Asn-23 1 orGln-232, and for 
HRG-P-GFD may be Met-226, AIa-227, Ser-228, Phe-229, Trp-230, or Lys23 1/Ser23 1 . The native C- 
terminus is determined readily by C- terminal sequencing, although it is not critical that HRG-GFD have the 
25*:-.' v. Native terminus so long as the GFD sequence possesses the desired activity. In some embodiments of HRG- 
GFD-CTC variants, the amino acid change(s) in the CTC are screened for their ability to resist proteolysis in 
vitro and inhibit the protease responsible for generation of HRG-GFD. 

HRG-ECD variants are made by providing a stop codon at the same location as for HRG-GFD-CTC 
variants. HRG-ECD may comprise any one or more of the variants described above in connection with its 
30 subfragments. e.g. the GFD-CTC variants containing CTC-proteolysis site modifications. 

If it is desired to prepare the longer HRG polypeptides and the 5' or 3' ends of the given HRG are 
not described herein, it may be necessary to prepare nucleic acids in which the missing domains are supplied 
by homologous regions from more complete HRG nucleic acids. Alternatively, the missing domains can be 
obtained by probing libraries using the DNAs disclosed in the Figures or fragments thereof. 
35 A. Isolation of DNA Encoding Heregulin 

The DNA encoding HRG may be obtained from any cDNA library prepared from tissue believed to 

possess HRG mRNA and to express it at a detectable level. HRG-a gene thus may be obtained from a 
genomic library. Similar procedures may be used for the isolation of other HRG, such as HRG-p I , HRG-p2, 
or HRG-P3 encoding genes. 

40 Libraries are screened with probes designed to identify the gene of interest or the protein encoded 

by it. For cDNA expression libraries, suitable probes include monoclonal or polyclonal antibodies that 
recognize and specifically bind to HRG-a; oligonucleotides of about 20-80 bases in length that encode 
known or suspected portions of HRG-a cDNA from the same or different species: and/or complementary or 
homologous cDNAs or fragments thereof that encode the same or a similar gene. Appropriate probes for 
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screening genomic DNA libraries include, but are not limited to. oligonucleotides; cDNAs or fragments 
thereof that encode the same or a similar gene: and/or homologous genomic DNAs or fragments thereof. 
Screening the cDNA or genomic library with the selected probe may be conducted using standard procedures 
as described in chapters 10-12 of Sambrook et ai, supra. 

An alternative means to isolate the gene encoding HRG-a is to use polymerase chain reaction (PCR) 
methodology as described in section 14 of Sambrook et ai t supra. This method requires the use of 
oligonucleotide probes that will hybridize to HRG-a. Strategies for selection of oligonucleotides are 
described below. 

Another alternative method for obtaining the gene of interest is to chemically synthesize it using one 
of the methods described in Engels et ai. (Agnew. Chem. int. Ed. Engl., 28: 716-734,1989. These methods 
include triester, phosphite, phosphoramidite and H-Phosphonate methods, PCR and other autoprimer 
methods, and oligonucleotide syntheses on solid supports. These methods may be used if the entire nucleic 
acid sequence of the gene is known, or the sequence of the nucleic acid complementary to the coding strand 
is available, or alternatively, if the target amino acid sequence is known, one may infer potential nucleic acid 
sequences using known and preferred coding residues for each amino acid residue. 

A preferred method of practicing this invention is to use carefully selected oligonucleotide 
sequences to screen cDNA libraries from various tissues, preferably human breast, colon, salivary gland, 
placental, fetal, brain, and carcinoma cell lines. Other biological sources of DNA encoding an heregulin-like 
huand include other mammals and birds. Among the preferred mammals are members of the following 
orders; bovine, ovine, equine, murine, and rodentia. 

The oligonucleotide sequences selected as probes should be of sufficient length and sufficiently 
unambiguous that false positives are minimized. The actual nucleotide sequence(s) may, for example, be 
based on conserved or highly homologous nucleotide sequences or regions of HRG-a. The oligonucleotides 
ma\ be degenerate at one or more positions. The use of degenerate oligonucleotides may be of particular 
importance where a library is screened from a species in which preferential codon usage in that species is not 
known. The oligonucleotide must be labeled such that it can be detected upon hybridization to DNA in the 
library being screened. The preferred method of labeling is to use 3 -P-labeled ATP with polynucleotide 
kinase, as is well known in the art, to radiolabel the oligonucleotide. However, other methods may be used 
to label the oligonucleotide, including, but not limited to. biotinylation or enzyme labeling. 

Of particular interest is HRG-a nucleic acid that encodes a full-length polypeptide. In some 
preferred embodiments, the nucleic acid sequence includes the native HRG-a signal sequence. Nucleic acid 
having all the protein coding sequence is obtained by screening selected cDNA or genomic libraries, and. if 
necessary, using conventional primer extension procedures as described in section 7.79 of Sambrook et ai., 
supra, to detect precursors and processing intermediates of mRNA that may not have been reverse- 
transcribed into cDNA. 

HRG-a encoding DNA of Figures 1A-1D may be used to isolate DNA encoding the analogous 
ligand from other animal species via hybridization employing the methods discussed above. The preferred 
animals are mammals, particularly bovine, ovine, equine, feline, canine and rodentia, and more specifically 
rats, mice and rabbits. 

B. Amino Acid Sequence Variants of Heregulin 

Amino acid sequence variants of HRG are prepared by introducing appropriate nucleotide changes 
into HRG DNA, or by in vitro synthesis of the desired HRG polypeptide. Such variants include, for 
example, deletions from, or insertions or substitutions of, residues within the amino acid sequence shown for 
human HRG sequences. Any combination of deletion, insertion, and substitution can be made to arrive at the 
final construct, provided that the final construct possesses the desired characteristics. Excluded from the 
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scope of this invention are HRG variants or polypeptide sequences that are not novel and unobvious over the 
prior art. The amino acid changes also may alter post-translational processes of HRG-a, such as changing 
the number or position of glycosylation sites, altering the membrane anchoring characteristics, altering the 
intra-celluiar location of HRG by inserting, deleting, or otherwise affecting the leader sequence of the native 
5 HRG. or modifying its susceptibility to proteolytic cleavage. 

The HRG sequence may be proteolytically processed to create a number of HRG fragments. HRG- 
GFD sequences of HRG-a all contain the amino acid sequence between HRG-a cysteine 226 and cysteine 
265. The amino terminus of HRG-a fragment may result from the cleavage of any peptide bond between 
alanine I and cysteine 226, preferably adjacent to an arginine, lysine, valine, or methionine, and most 

10 preferably between methionine 45 and serine 46, The carboxy terminus of HRG-a fragment may result from 
the cleavage of any peptide bond between cysteine 265, preferably adjacent to an arginine, lysine, valine, or 
methionine, and most preferably between lysine 272 and valine 273, between lysine 278 and alanine 279, or 
between lysine 285 and arginine 286. The resulting HRG-a ligands resulting from such proteolytic 
processing are the preferred ligands. 

15 HRG-P-GFD's are analogous to those discussed above for HRG-a-GFD's. Each HRG-p-GFD 

contains me polypeptide segment from cysteine 212 to cysteine 251 of figure 2A-E. The amino terminus of 
HRG-pl fragment may result from the cleavage of any peptide bond between alanine I and cysteine 212, 
preferably adjacent to an arginine, lysine, valine, or methionine, and most preferably between methionine 31 
and serine 32. The carboxy terminus of HRG-P! fragment may result from the cleavage of any peptide bond 

20 between cysteine 251 of Fig. 2A-2E, preferably adjacent to an arginine, lysine, valine, ur methionine, and 
most preferably between valine 255 and methionine 256, between lysine 261 and histidine 262, between 
lysine 276 and alanine 277, or between lysine 301 and thrionine 302. The resulting HRG-P 1 ligands 
resulting from such proteolytic processing are among the preferred ligands. Similarly, processing to produce 

cr , preferred fragment ligands of HRG-P2 based upon the Fig. 3A-3E and HRG-P3 based upon Fig. 4A-4C may 

25 be accomplished by cleaving HRG sequences of Figs. 3A-3E and 4A-4C preferably adjacent to an arginine, 
lysine, valine or methionine. 

In designing amino acid sequence variants of HRG, the location of the mutation site and the nature 
of the mutation will depend on HRG characteristic(s) to be modified. The sites for mutation can be modified 
Individually or in series, e.g., by (1) substituting first with conservative amino acid choices and then with 

30 more radical selections depending upon the results achieved, (2) deleting the target residue, or (3) inserting 
residues of other receptor ligands adjacent to the located site. 

A useful method for identification of certain residues or regions of HRG polypeptide that are 
preferred locations for mutagenesis is called "alanine scanning mutagenesis" as described by Cunningham 
and Wells (Science. 244: 1081-1085, 1989). Here, a residue or group of target residues are identified (e.g., 

35 charged residues such as arg, asp, his, lys, and glu) and replaced by a neutral or negatively charged amino 
acid (most preferably alanine or polyalanine) to affect the interaction of the amino acids with the surrounding 
aqueous environment in or outside the cell. Those domains demonstrating functional sensitivity to the 
substitutions then are refined by introducing further or other variants at or for the sites of substitution. Thus, 
while the site for introducing an amino acid sequence variation is predetermined, the nature of the mutation 

40 per se need not be predetermined. For example, to optimize the performance of a mutation at a given site, 
ala scanning or random mutagenesis may be conducted at the target codon or region and the expressed HRG 
variants are screened for the optimal combination of desired activity. 

There are two principal variables in the construction of amino acid sequence variants: the location 
of the mutation site and the nature of the mutation. These are variants from HRG sequence, and may 



18 



WO 99/39729 



PCT/US99/02390 



represent naturally occurring alleles (which will not require manipulation of HRG DNA) or predetermined 
mutant forms made by mutating the DNA, either to arrive at an allele or a variant not found in nature. In 
general the location and nature of the mutation chosen will depend upon HRG characteristic to be modified. 
Obviously, such variations that, for example, convert HRG into a known receptor ligand, are not included 
within the scope of this invention, nor are any other HRG variants or polypeptide sequences that are not 
novel and unobvious over the prior art. 

Amino acid sequence deletions generally range from about I to 30 residues, more preferably about 
1 to 10 residues, and typically about 1 to 5 are contiguous. Deletions may be introduced into regions of low 
homology with other EGF family precursors to modify the activity of HRG. Deletions from HRG in areas of 
substantial homology with other EGF family sequences will be more likely to modify the biological activity 
of HRG more significantly. The number of consecutive deletions will be selected so as tc preserve the 
tertiary structure of HRG in the affected domain, e.g., cysteine crossiinking, beta-pleated sheet or alpha 
helix. 

Amino acid sequence insertions include amino- and/or carboxyl-terminal fusions ranging in length 
from one residue to polypeptides containing a hundred or more residues, as well as intrasequence insertions 
of single or multiple amino acid residues. Intrasequence insertions (i.e., insertions within HRG sequence) 
may range generally from about I to 10 residues, more preferably 1 to 5, and most preferably I to 3. 
Examples of terminal insertions include HRG with an N-terminal methionyl residue (an artifact of the direct 
expression of HRG in bacterial recombinant cell culture), and fusion of a heterologous N-terminal signal 
sequence to the N-terminus of HRG to facilitate the secretion of mature HRG from recombinant host cells. 
Such signal sequences generally will be obtained from, and thus be homologous to, the intended host cell 
species. Suitable sequences include STIl, tPA or Ipp for £. co/j, alpha factor for yeas:, and viral signals such 
as herpes gD for mammalian cells. 

Other insertional variants of HRG include the fusion to the N- or C-terminus of HRG of an 
immunogenic polypeptide, e.g., bacterial polypeptides such as beta-lactamase or an enzyme encoded by the 
£. coli trp locus, or yeast protein, bovine serum albumin, and chemotactic polypeptides. C-terminal fusions 
of HRG-ECD with proteins having a long half-life such as immunoglobulin constant regions (or other 
immunoglobulin regions), albumin, or ferritin, as described in WO 89/02922, published 6 April 1989 are 
contemplated. 

Another group of variants are amino acid substitution variants. These variants have at least one 
amino acid residue in HRG molecule removed and a different residue inserted in its place. The sites of 
greatest interest for substitutional mutagenesis include sites identified as the active site(s) of HRG, and sites 
where the amino acids found in HRG ligands from various species are substantially different in terms of side- 
chain bulk, charge, and/or hydrophobicity. A likely sub-domain of HRG-GFD having biological activity as a 
growth factor is the C-terminal segment, in particular within the sequence about from glycine 218 to valine 
226 (HRG-a), and glycine 218 to lysine 228/serine 228 (HRG-p) based upon analogy to the EGF sub- 
sequence found to have EGF activity. 

Other sites of interest are those in which particular residues of HRG-like ligands obtained from 
various species are identical. These positions may be important for the biological activity of HRG. These 
sites, especially those falling within a sequence of at least three other identically conserved sites, are 
substituted in a relatively conservative manner. Such conservative substitutions are shown in Table 1 under 
the heading of "preferred substitutions". !f such substitutions result in a change in biological activity, then 
more substantial changes, denominated exemplary substitutions in Table 1, or as further described below in 
reference to amino acid classes, are introduced and the products screened. 
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TABLE I 



Original 
Residue 


Exemplary 

^nh*;f imt innc 


Preferred 

QiiKcf ifntinnc 

OULJjlIlLillUll.) 


Aia (A) 


vai; leu; ile 


val 


Arg(R) 


lys; gin; asn 


lys 


Asn (N) 


gin; his; lys; arg 


gin 


A Qn (Wl 
/\Sp {U) 


glu 


glu 


Cys (C) 


ser 


ser 


Gin (Q) 


asn 


asn 


Glu(E) 


asp 


asp 


Gly (G) 


pro 


pro 


His (H) 


asn; gin; lys; arg 


arg 


He (I) 


leu; val; met; ala; phe; 
norleucine 


leu 


I en (\ \ 


nuncucint., lie, vdi, 
met; ala; phe 


ile 


Lys(K) 


arg; gin; asn 


arg 


Met(M) 


leu; phe; ile 


leu 


Phe (F) 


leu; vai; ile; ala 


leu 


i'ro (?) 


giy 


giy 


Ser (S) 


thr 


thr 


Ihr (T) 


ser 


ser 


lrp<\V> 


ryr 


ryr 


1 yrfY> 


trp; phe; thr; ser 


phe 


VahV) 


ile; leu; met; phe; 
ala; norleucine 


leu 



Substantial modifications in function or immunological identity of HRG are accomplished by 
selecting substitutions that differ significantly in their effect on maintaining (a) the structure of the 
polypeptide backbone in the area of the substitution, for example, as a sheet or helicd conformation, (b) the 
charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. Naturally 
occurring residues are divided into groups based on common side chain properties: 

1 ) hydrophobic: norleucine. met, ala, val. leu, ile; 

2) neutral hydrophilic: cys. ser, thr; 

3) acidic: asp, glu; 

4) basic: asn, glri, his, lys, arg; 

5) residues that influence chain orientation: gly, pro; and 

6) aromatic: trp, ryr, phe. 

Non-conservative substitutions will entail exchanging a member of one of these classes for another. 
Such substituted residues may be introduced into regions of HRG that are homologous with other receptor 
ligands, or, more preferably, into the non-homologous regions of the molecule. 

In one embodiment of the invention, it is desirable to inactivate one or more protease cleavage sites 
that are present in the molecule. These sites are identified by inspection of the encoded amino acid sequence. 
Where protease cleavage sites are identified, they are rendered inactive to proteolytic cleavage by 
substituting the targeted residue with another residue, preferably a basic residue such as glutamine or a 
hydrophylic residue such as serine; by deleting the residue; or by inserting a prolyl residue immediately after 
the residue. 
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In another embodiment, any methionyi residue other than the starting methionyl residue of the 
signal sequence, or any residue located within about three residues N- or C-terminal to each such methionyl 
residue, is substituted by another residue (preferably in accord with Table 1 ) or deleted. Alternatively, about 
1-3 residues are inserted adjacent to such sites. 
5 Any cysteine residues not involved in maintaining the proper conformation of HRG also may be 

substituted, generally with serine, to improve the oxidative stability of the molecule and prevent aberrant 
crossl inking. 

Sites particularly suited for substitutions, deletions or insertions, or use as fragments, include, 
numbered from the N-terminus of HRG-oc of Figure 1 A - ID: 
10 1) potential glycosaminoglycan addition sites at the serine-glycine dipeptides at 42-43, 64-65, 151- 
152; 

2) potential asparagine-linked glycosylation at positions 164, 170, 208 and 437, sites (NDS) 164-166, 
(NIT) 170-172, (NTS) 208-210, and NTS (609-61 1); 

3) potential O-glycosylation in a cluster of serine and threonine at 209-2 1 8; 
1 5 4) cysteines at 226, 234, 240, 254, 256 and 265; 

5) transmembrane domain at 287-309; 

6) loop 1 delineated by cysteines 226 and 240; 

7) loop 2 delineated by cysteines 234 and 254; 

8) loop 3 delineated by cysteines 256 and 265; and 

20 9) potential protease processing sites at 2-3, 8-9. 23-24, 33-34, 36-37, 45-46. 48-49, 62-63, 66-67, 86- 
87, 1 10-1 1 1, 123-124, 134-135. 142-143, 272-273, 278-279 and 285-286; 

Analogous regions in HRG-Pl may be determined by reference to its' sequence. The analogous 
HRG-pl amino acids may be mutated or modified as discussed above for HRG-a. Analogous regions in 
HRG-P2 may also be determined by reference to its' sequence. The analogous HRG-P2 amino acids may be 
25 mutated or modified as discussed above for HRG-a or HRG-pl. Analogous regions in HRG-P3 may be 
determined by reference to its' sequence. Further, the analogous HRG-p3 amino acids may be mutated or 
modified as discussed above for HRG-a. HRG-Pl, or HRG-P2. 

Another HRG variant is y-HRG (or gamma-heregulin). y-HRG is any polypeptide sequence that 

possesses at least one biological property of native sequence y-HRG having SEQ ID NO:l 1 . The biological 
30 property of this variant is the same as for HRG noted above. This variant encompasses not only the 
polypeptide isolated from a native y-HRG source such as human MDA-MB-1 75 cells or from another 
source, such as another animal species, but also the polypeptide prepared by recombinant or synthetic 
methods. It also includes variant forms including functional derivatives, allelic variants, naturally occurring 
isoforms and analogues thereof. Sometimes the y-HRG is "native y-HRG" which refers to endogenous y- 
35 HRG polypeptide which has been isolated from a mammal. The y-HRG can also be "native sequence y- 
HRG" insofar as it has the same amino acid sequence as a native y-HRG (e.g. human y-HRG shown in Fig. 
7A-7C). Amino acid sequence variants of the native sequence are prepared by introducing appropriate 
nucleotide changes into the native sequence DNA, or by in vitro synthesis of the desired polypeptide. Such 
variants include, for example, deletions from, or insertions or substitutions of. residues within the amino acid 
40 sequence shown for the human protein in Fig. 7A-7C as generally described above for other HRG. Any : 
combination of deletion, insertion, and substitution is made to arrive at the final construct, provided that the 
final construct possesses the desired characteristics. The amino acid changes also may alter post- 
translational processes of the native sequence, such as changing the number or position of O-linked 
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glycosylation sites. 

Another variant is the polypeptide referred to as sensory' and motor neuron derived factor (SMDF) 
whose nucleic acid and amino acid sequences (SEQ ID NOS:13 and 14) are shown in Fig. 8 which can be 
prepared as described in WO 96/15244. The SMDF polypeptides of the invention exhibit the properties of 
5 binding to the HER2/HER3 receptors and stimulating epithelial cell growth and differentiation in a manner 
similar to HRG polypeptides discussed above. Amino acid sequence variants of native sequence SMDF are 
prepared by introducing appropriate nucleotide changes into the native sequence SMDF DNA, or by in vitro 
synthesis of the desired SMDF polypeptide as observed generally above for other HRG. Such variants 
include, for example, deletions from, or insertions or substitutions of, residues within the amino acid 
10 sequence shown for human SMDF in Fig. 8. Any combination of deletion, insertion, and substitution is 
made to arrive at the final construct, provided that the final construct possesses the desired characteristics. 
The amino acid changes also may alter post-translational processes of the native sequence SMDF. such as 
changing the number or position of O-linked glycosylation sites. 

Additional variants include polypeptides in which the variant has an amino acid 

15 substitution at a selected residue corresponding to a residue of 645-amino acid native human heregulin-pl 
selected from: 

SI 77, H178, L179, V180, KI8I, E184, E186, 
K187, T188, V19L N192. G193, G194, EI95, 
M198, VI 99, K200, D201. N204, P205, S206, 
20 R207. Y208, L209, K2!l. P213, N214, E215, 

T217, G218, D219, Q222. N223, Y224, S228, and F229. 
In a variation of this embodiment, the amino acid substitution is not a replacement of the selected residue 
with an epidermal growth factor (EGF) residue corresponding to the selected residue. 
Other heregulin-pl variants include an amino acid substitution selected from: 
25 S177W; H178S, E, R. or A; V180Q, I or E; 

KI81P or A; A183G; E184V, W, K, R, G, orN; 
K185E, S, Q ? or G; E186R; K187E or A; T188Q; 
E195Q; F197Y; MI98R or K; K200R; D201Toi i; 
P205T or Y; S206K. H. G, P. or R; R207Y: 
30 Y208R or L; L209M or G; K211 R; P213S, T, N, or K; 

N214L, K, S, or E; F216M; N223H or W; and M226I. 
In a variation of this embodiment, the heregulin variant includes sets of amino acid substitutions selected 
from this group. Some heregulin variants of the invention having sets of amino acid substitutions exhibit at 
least a 50-fold increase in HER3 receptor affinity, which is also accompanied by an increase in HER4 
35 receptor affinity . Specific variants include: 

AI83G, E184W, K185D, E186R, K187E, TI88G, M2261; 
A183D, E184K, K185S, E186R, K187E, T188G, M226I; 
F197Y, M198K, K200R, D201I, M226I; 
P205Y. S206G, R207Y, Y208L, L209M: 
40 P205Y, S206R, R207Y, Y208R, L209M, M226I; 
P205T, S206H, R207Y, Y208R, L209M; 
P205T. S206K. R207Y, Y208R, L209G; 
N223W. M226I; 
N223H. M226I; 

45 SI77W. H178E, K18IP, AI83G, E184W. K185D, E186R. K187E, T188G, M226I; 
P205Y. S206G, R207Y, Y208L, L209M. M226I; 
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A1830. K185E, EI86R. KI87E, T188G, FI97Y, MI98R. D201T: 

AI83G, KI85E. E186R, K187E, T188G, P205Y. S206G. R207Y, Y208L. L209M; 

A183G, K185E, E186R, K187E, T188G, F197Y, M I98R. D201T. P205Y, S206G, R207Y, Y208L, L209M; 
AI83G, KI85E. EI86R, K187E, T188G. M226I; 
5 FI97Y, M198R, D201T. P205Y. S206G, R207Y, Y208L, L209M; 

F197Y, MI98R. D201T, P205Y, S206G, R207Y, Y208L. L209M. M226I: 
F197Y. MI98R, D201T, M226I; 

AI83G, K185E, EI86R, K187E, T188G, F197Y, M198R, D201T, M2261; 
AI83G, K185E, E186R, K187E, T188G, P205Y, S206G, R207Y. Y208L. L209M, M2261; 
10 A183G, K185E, E186R, K187E, TI88G, F197Y, M198R, D201T. P205Y. S206G. R207Y, Y208L, L209M, 
M226I; 

F197Y. M198R, D201T, P205Y, S206G, R207Y, Y208L, L209M N223H, M226I; and 

A183G. KI85E, E186R. KI87E, T188G, F197Y, M198R, D201T, P205Y, S206G, R207Y, Y208L, L209M, 

N223H. M226I. 

15 In addition to including one or more of the amino acid substitutions disclosed herein, the heregulin 

variant can have one or more other modifications, such as an amino acid substitution, an insertion of at least 
one amino acid, a deletion of at least one amino acid, or a chemical modification. For example, the invention 
provides a heregulin variant that is a fragment. In a variation of this embodiment, the fragment includes 
residues corresponding to a portion of human heregulin-pl extending from about residue 175 to about 

20 residue 230 (i.e., the EGF-like domain). For example, the fragment can extend from residue 177 to residue 
244 and may be prepared by recombinant techniques (rHRGP 1-1 77-244). 

DNA encoding amino acid sequence variants of HRG is prepared by a variety of methods known in 
the art. These methods include, but are not limited to, isolation from a natural source (in the case of naturally 
occurring amino acid sequence variants) or preparation by oligonucleotide-mediated (or site-directed) 

25 mutagenesis. PCR mutagenesis, and cassette mutagenesis of an earlier prepared variant or a non-variant 
version of HRG. These techniques may utilize HRG nucleic acid (DNA or RNA). or nucleic acid 
complementary to HRG nucleic acid. 

Oligonucleotide-mediated mutagenesis is a preferred method for preparing substitution, deletion, 
and insertion variants of HRG DNA. This technique is well known in the art as described by Adeiman et a/.. 

30 DNA, 2: 183 (1983). Briefly, HRG DNA is altered by hybridizing an oligonucleotide encoding the desired 
mutation to a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage 
containing the unaltered or native DNA sequence of HRG. After hybridization, a DNA polymerase is used 
to synthesize an entire second complementary strand of the template that will thus incorporate the 
oligonucleotide primer, and will code for the selected alteration in HRG DNA. 

35 Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal 

oligonucleotide will have 12 to 15 nucleotides that are completely complementary' to the template on either 
side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize 
properly to the single-stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques known in the art such as that described by Crea et at. (Proc. Natl. Acad. Set. USA. 75: 

40 5765,1978). 

Single-stranded DNA template may also be generated by denaturing double-stranded plasmid (or 
other) DNA using standard techniques. 

For alteration of the native DNA sequence (to generate amino acid sequence variants, for example), 
the oligonucleotide is hybridized to the single-stranded template under suitable hybridization conditions. A 
45 DNA polymerizing enzyme, usually the Klenow fragment of DNA polymerase I. is then added to synthesize 
the complementary strand of the template using the oligonucleotide as a primer for synthesis. A 
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heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated form of HRG. and 
the other strand (the original template) encodes the native, unaltered sequence of HRG. This heteroduplex 
molecule is then transformed into a suitable host ceil, usually a prokaryote such as £. colt JMI01. After the 
cells are grown, they are plated onto agarose plates and screened using the oligonucleotide primer 

5 radiolabeled with ^-P-phosphate to identify the bacterial colonies that contain the mutated DNA. The 
mutated region is then removed and placed in an appropriate vector for protein production, generally an 
expression vector of the type typically employed for transformation of an appropriate host. 

The method described immediately above may be modified such that a homoduplex molecule is 
created wherein both strands of the plasmid contain the mutation(s). The modifications are as follows: the 
10 single-stranded oligonucleotide is annealed to the single-stranded template as described above. A mixture of 
three deoxyribonucleotides. deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), and 
deoxyribothymidine (dTTP), is combined with a modified thio-deoxyribocytosine called dCTP-(aS) (which 
can be obtained from Amersham Corporation). This mixture is added to the template-oligonucleotide 
complex. Upon addition of DNA polymerase to this mixture, a strand of DNA identical to the template 
15 except for the mutated bases is generated. In addition, this new strand of DNA will contain dCTP-(aS) 
instead of dCTP, which serves to protect it from restriction endonuclease digestion. After the template strand 
of the double-stranded heteroduplex is nicked with an appropriate restriction enzyme, the template strand can 
be digested with ExoUl nuclease or another appropriate nuclease past the region that contains the site(s) to be 
mutagenized. The reaction is then stopped to leave a molecule that is only partially single-stranded. A 
20 complete double-stranded DNA homoduplex is then formed using DNA polymerase in the presence of all 
four deoxyribonucleotide triphosphates, ATP. and DNA ligase. This homoduplex molecule can then be 
transformed into a suitable host cell such as £. coli JMIOI , as described above. 

DNA encoding HRG mutants with more than one amino acid to be substituted may be generated in 
one of several ways. If the amino acids are located close together in the polypeptide chain, they may be 
25 mutated simultaneously using one oligonucleotide that codes for all of the desired amino acid substitutions. 
If, however, the amino acids are located some distance from each other (separated by more than about ten 
amino acids), it is more difficult to generate a single oligonucleotide that encodes all of the desired changes. 
Instead, one of two alternative methods may be employed. 

In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. 
30 The oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second 
strand of DNA that is synthesized from the template will encode all of the desired amino acid substitutions. 

The alternative method involves rwo or more rounds of mutagenesis to produce the desired mutant. 
The first round is as described for the single mutants: wild-type DNA is used for the template, an 
oligonucleotide encoding the first desired amino acid substitution(s) is annealed to this template, and the 
35 heteroduplex DNA molecule is then generated. The second round of mutagenesis utilizes the mutated DNA 
produced in the first round of mutagenesis as the template. Thus, this template already contains one or more 
mutations. The oligonucleotide encoding the additional desired amino acid substitution(s) is then annealed 
to this template, and the resulting strand of DNA now encodes mutations from both the first and second 
rounds of mutagenesis. This resultant DNA can be used as a template in a third round of mutagenesis, and so 
40 on. 

PCR mutagenesis is also suitable for making amino acid variants of HRG. While the following 
discussion refers to DNA, it is understood that the technique also finds application with RNA. The PCR 
technique generally refers to the following procedure (see Erlich, supra, the chapter by R. Higuchi, p. 61- 
70). When small amounts of template DNA are used as starting material in a PCR, primers that differ 
45 slightly in sequence from the corresponding region in a template DNA can be used to generate relatively 
large quantities of a specific DNA fragment that differs from the template sequence only at the positions 
where the primers differ from the template. For introduction of a mutation into a plasmid DNA, one of the 
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primers is designed to overlap the position of the mutation and to contain the mutation: the sequence of the 
other primer must be identical to a stretch of sequence of the opposite strand of the plasmid, but this 
sequence can be located anywhere along the plasmid DNA. It is preferred, however, that the sequence of the 
second primer is located within 200 nucleotides from that of the first, such that in the end the entire amplified 
5 region of DNA bounded by the primers can be easily sequenced. PCR amplification using a primer pair like 
the one just described results in a population of DNA fragments that differ at the position of the mutation 
specified by the primer, and possibly at other positions, as template copying is somewhat error-prone. 

If the ratio of template to product material is extremely low, the vast majority of product DNA 
fragments incorporate the desired mutation(s). This product material is used to replace the corresponding 

10 region in the plasmid that served as PCR template using standard DNA technology. Mutations at separate 
positions can be introduced simultaneously by either using a mutant second primer, or performing a second 
PCR with different mutant primers and ligating the two resulting PCR fragments simultaneously to the vector 
fragment in a three (or more)-part ligation. 

In a specific example of PCR mutagenesis, template plasmid DNA (lmg) is linearized by digestion 

1 5 with a restriction endonuclease that has a unique recognition site in the plasmid DNA outside of the region to 
be amplified. Of this material, 100 ng is added to a PCR mixture containing PCR buffer, which contains the 
four deoxy nucleotide tri-phosphates and is included in the GENEAMP kits (obtained from Perkin-Elmer 
Cetus, Norwalk, CT and Emeryville, CA), and 25 pmole of each oligonucleotide primer, to a final volume of 
50 ml. The reaction mixture is overlaid with 35 ml mineral oil. The reaction is denatured for 5 minutes at 

20 I00°C, placed briefly on ice, and then I ml Thermits aquations (Taq) DNA polymerase (5 units/mi. 
purchased from Perkin-Elmer Cetus, Norwalk, CT and Emeryville, CA) is added below the mineral oil layer. 
The reaction mixture is then inserted into a DNA Thermal Cycler (purchased from Perkin-Elmer Cetus) 
programmed as follows: 

2min. 55°C, 

25 30 sec. 72°C, then 19 cycles of the following: 

30 sec. 94°C, 
30 sec. 55°C, and 
30 sec. 72°C. 

At the end of the program, the reaction vial is removed from the thermal cycler and the aqueous 

30 phase transferred to a new vial, extracted with phenol/chloroform (50:50:vol), and ethanol precipitated, and 
the DNA is recovered by standard procedures. This material is subsequently subjected to the appropriate 
treatments for insertion into a vector. 

Another method for preparing variants, cassette mutagenesis, is based on the technique described by 
Wells et al. (Gene, 34: 315,1985). The starting material is the plasmid (or other vector) comprising HRG 

35 DNA to be mutated. The codon(s) in HRG DNA to be mutated are identified. There must be a unique 
restriction endonuclease site on each side of the identified mutation site(s). if no such restriction sites exist, 
they may be generated using the above-described oligonucieotide-mediated mutagenesis method to introduce 
them at appropriate locations in HRG DNA. After the restriction sites have been introduced into the plasmid, 
the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of 

40 the DNA between the restriction sites but containing the desired mutation(s) is synthesized using standard 
procedures. The two strands are synthesized separately and then hybridized together using standard 
techniques. This double-stranded oligonucleotide is referred to as the cassette. This cassette is designed to 
have 3' and 5* ends that are compatible with the ends of the linearized plasmid, such that it can be directly ^ t v „ , 
iigated to the plasmid. This plasmid now contains the mutated HRG DNA sequence. 

45 C. Insertion of DNA into a Cloning Vehicle 

The cDNA or genomic DNA encoding native or variant HRG is inserted into a repiicable vector for 
further cloning (amplification of the DNA) or for expression. Many vectors are available, and selection of 
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the appropriate vector will depend on 1) whether it is to be used for DNA amplification or for DNA 
expression, 2) the size of the DNA to be inserted into the vector, and 3) the host cell to be transformed with 
the vector. Each vector contains various components depending on its function (amplification of DNA or 
expression of DNA) and the host cell for which it is compatible. The vector components generally include, 
5 but are not limited to, one or more of the following: a signal sequence, an origin of replication, one or more 
marker genes, an enhancer element, a promoter, and a transcription termination sequence. 

(i) Signal Sequence Component 
In general, the signal sequence may be a component of the vector, or it may be a pan of HRG DNA 
that is inserted into the vector. The native HRG DNA is believed to encode a signal sequence at the amino 

10 terminus (5' end of the DNA encoding HRG) of the polypeptide that is cleaved during post-rranslational 
processing of the polypeptide to form the mature HRG polypeptide ligand that binds to the HER2/HER3 
receptor, although a conventional signal structure is not apparent. Native HRG is, secreted from the cell but 
remains lodged in the membrane because it contains a transmembrane domain and a cytoplasmic region in 
the carboxyl terminal region of the polypeptide. Thus, in a secreted, soluble version of HRG the carboxyl 

15 terminal domain of the molecule, including the transmembrane domain, is ordinarily deleted. This truncated 
variant HRG polypeptide may be secreted from the cell, provided that the DNA encoding the truncated 
variant encodes a signal sequence recognized by the host. 

HRG of this invention may be expressed not only directly, but also as a fusion with a heterologous 
polypeptide, preferably a signal sequence or other polypeptide having a specific cleavage site at the N-and/or 

20 C-terminus of the mature protein or polypeptide. In general the signal sequence may be a component of the 
vector, or it may be a part of HRG DNA that is inserted into the vector. Included within the scope of this 
invention are HRG with the native signal sequence deleted and replaced with a heterologous signal sequence. 
The heterologous signal sequence selected should be one that is recognized and processed, i.e., cleaved by a 
signal peptidase, by the host cell. For prokaryotic host cells that do not recognize and process the native 

25 HRG signal sequence, the signal sequence is substituted by a prokaryotic signal sequence selected, for 
example, from the group of the alkaline phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. 
For yeast secretion the native HRG signal sequence may be substituted by the yeast invertase, alpha factor, or 
acid phosphatase leaders. In mammalian cell expression the native signal sequence is satisfactory, although 
other mammalian signal sequences may be suitable. 

30 (ii> Origin of Replication Component 

Both expression and cloning vectors generally contain a nucleic acid sequence that enables the 
vector to replicate in one or more selected host cells. Generally, in cloning vectors this sequence is one that 
enables the vector to replicate independently of the host chromosomal DNA, and includes origins of 
replication or autonomously replicating sequences. Such sequences are well known for a variety of bacteria, 

35 yeast, and viruses. The origin of replication from the plasmid pBR322 is suitable for most Gram-negative 
bacteria, the 2m plasmid origin is suitable for yeast, and various viral origins (SV40, polyoma, adenovirus, 
VSV or BPV) are useful for cloning vectors in mammalian cells. Generally, the origin of replication 
component is not needed for mammalian expression vectors (the SV40 origin may typically be used only 
because it contains the early promoter). 

40 Most expression vectors are "shuttle" vectors, i.e., they are capable of replication in at least one 

class of organisms but can be transfected into another organism for expression. For example, a vector is 
cloned in E. coli and then the same vector is transfected into yeast or mammalian ceils for expression even 
though it is not capable of replicating independently of the host cell chromosome. 

DNA may also be amplified by insertion into the host genome. This is readily accomplished using 

45 Bacillus species as hosts, for example, by including in the vector a DNA sequence that is complementary to a 
sequence found in Bacillus genomic DNA. Transfection of Bacillus with this vector results in homologous 
recombination with the genome and insertion of HRG DNA. However, the recovery of genomic DNA 
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encoding HRG is more complex than that of an exogenously replicated vector because restriction enzyme 
digestion is required to excise HRG DNA. DNA can be amplified by PGR and directly transfecied into the 
host cells without any replication component. 

(iii) Selection Gene Component 

Expression and cloning vectors should contain a selection gene, also termed a selectable marker. 
This gene encodes a protein necessary for the survival or growth of transformed host cells grown in a 
selective culture medium. Host cells not transformed with the vector containing the selection gene will not 
survive in the culture medium. Typical selection genes encode proteins that (a) confer resistance to 
antibiotics or other toxins, e.g., ampicillin. neomycin, methotrexate, or tetracycline, (b) complement 
auxotrophic deficiencies, or (c) supply critical nutrients not available from complex media, e.g., the gene 
encoding D-alanine racemase for Bacilli. 

One example of a selection scheme utilizes a drug to arrest growth of a host cell. Those cells that 
are successfully transformed with a heterologous gene express a protein conferring drug resistance and thus 
survive the selection regimen. Examples of such dominant selection use the drugs neomycin (Southern et 
aL, J. Molec. Appl. Genet 1: 327.1982), mycophenolic acid (Mulligan et ai. Science 209: 1422,1980) or 
hygromycin (Sugden et ai. Mol. Cell. Biol. 5: 410-413,1985). The three examples given above employ 
bacterial genes under eukaryotic control to convey resistance to the appropriate drug G418 or neomycin 
(geneticin). xgpt (mycophenolic acid), or hygromycin, respectively. 

Another example of suitable selectable markers for mammalian cells are those that enable the 
identification of cells competent to take up HRG nucleic acid, such as dihydrofolate reductase (DHFR) or 
thymidine kinase. The mammalian cell transformants are placed under selection pressure which only the 
transformants are uniquely adapted to survive by virtue of having taken up the marker. Selection pressure is 
imposed by culturing the transformants under conditions in which the concentration of selection agent in the 
medium is successively changed, thereby leading to amplification of both the selection gene and the DNA 
that encodes HRG. Amplification is the process by which genes in greater demand for the production of a 
protein critical for growth are reiterated in tandem within the chromosomes of successive generations of 
recombinant cells. Increased quantities of HRG are synthesized from the amplified DNA. 

For example, cells transformed with the DHFR selection gene are first identified by culturing all of 
the transformants in a culture medium that contains methotrexate (Mtx), a competitive antagonist of DHFR. 
An appropriate host cell when wild-type DHFR is employed is the Chinese hamster ovary (CHO) cell line 
deficient in DHFR activity, prepared and propagated as described by Urlaub and Chasin, Proc. Natl. Acad 
Sci. USA, 77: 4216, 1980. The transformed cells are then exposed to increased levels of methotrexate. This 
leads to the synthesis of multiple copies of the DHFR gene, and, concomitantly, multiple copies of other 
DNA comprising the expression vectors, such as the DNA encoding HRG. This amplification technique can 
be used with any otherwise suitable host, e.g., ATCC No. CCL61 CHO-K1, notwithstanding the presence of 
endogenous DHFR if, for example, a mutant DHFR gene that is highly resistant to Mrx is employed (EP 
1 17,060). Alternatively, host cells (particularly wild-type hosts that contain endogenous DHFR) transformed 
or co-transformed with DNA sequences encoding HRG, wild-type DHFR protein, and another selectable 
marker such as aminoglycoside 3* phosphotransferase (APH) can be selected by cell growth in medium 
containing a selection agent for the selectable marker such as an aminoglycosidic antibiotic, e.g.. kanamycin, 
neomycin, or G418 (see U.S. Pat. No. 4,965,199). 

A suitable selection gene for use in yeast is the trp\ gene present in the yeast plasmid YRp7 
(Stinchcomb et at.. Nature, 282: 39, 1979; Kingsman et ai. Gene, 7: 141. 1979; or Tschemper et ai. Gene, 
10: 157, 1980). The trp\ gene provides a selection marker for a mutant strain of yeast lacking the ability to 
grow in tryptophan, for example, ATCC No. 44076 or PEP4-1 (Jones, Genetics, 85: 12, 1977). The presence 
of the trp\ lesion in the yeast host cell genome then provides an effective environment for detecting 
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transformation by growth in the absence of tryptophan. Similarly, Leu2-deficient yeast strains (ATCC 
20,622 or 38.626) are complemented by known piasmids bearing the Leu2 gene. 

(iv) Promoter Component 

Expression and cloning vectors usually contain a promoter that is recognized by the host organism 
and is operably linked to HRG nucleic acid. Promoters are untranslated sequences located upstream (5') to 
the start codon of a structural gene (generally within about 100 to 1000 bp) that control the transcription and 
translation of a particular nucleic acid sequence, such as HRG to which they are operably linked. Such 
promoters typically fall into two classes, inducible and constitutive. Inducible promoters are promoters that 
initiate increased levels of transcription from DNA under their control in response to some change in culture 
conditions, e.g., the presence or absence of a nutrient or a change in temperature. At this time a large 
number of promoters recognized by a variety of potential host cells are well known. These promoters are 
operably linked to DNA encoding HRG by removing the promoter from the source DNA by restriction 
enzyme digestion and inserting the isolated promoter sequence into the vector. Both the native HRG 
promoter sequence and many heterologous promoters may be used to direct amplification and/or expression 
of HRG DNA. However, heterologous promoters are preferred, as they generally permit greater 
transcription and higher yields of expressed HRG as compared to the native HRG promoter. 

Promoters suitable for use v/ith prokaryotic hosts include the b-!actamase and lactose promoter 
systems (Chang et a/.. Nature, 275: 615, 1978; and Goeddel et al. t Nature 281: 544, 1979), alkaline 
phosphatase, a tryptophan (trp) promoter system (Goeddel. Nucleic Acids Res., 8: 4057, 1980 and EP 
36,776), tPA (U.S. 5,641,655) and hybrid promoters such as the tac promoter (deBoer et a/., Proc. Natl. 
Acad Sci. USA 80: 21-25. 1983). However, other known bacterial promoters are suitable. Their nucleotide 
sequences have been published, thereby enabling a skilled worker operably to ligate them to DNA encoding 
HRG (Siebenlist et al. y Cell 20: 269, 1980) using linkers or adaptors to supply any required restriction sites. 
Promoters for use in bacterial systems also generally will contain a Shine-Dalgarno (S.D.) sequence operably 
linked to the DNA encoding HRG. 

Suitable promoting sequences for use with yeast hosts include the promoters for 3-phosphoglycerate 
kinase (Hitzeman et aL J. Biol. Chem. t 255: 2073, 1980) or other glycolytic enzymes (Hess et ai. J. Adv. 
Enzyme Reg 7: M9, 1968; and Holland, Biochemistry 17: 4900, 1978), such as enolase, glyceraldehyde ?- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, giucose-6-phosphate 
isomerase. 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose 
isomerase. and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of 
transcription controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with nitrogen metabolism, 
metallothionein, glyceraldehyde-3-phosphate dehydrogenase, and enzymes responsible for maltose and 
galactose utilization. Suitable vectors and promoters for use in yeast expression are further described in 
Hitzeman et al. % EP 73,657A. Yeast enhancers also are advantageously used with yeast promoters. 

Promoter sequences are known for eukaryotes. Virtually all eukaryotic genes have an AT-rich 
region located approximately 25 to 30 bases upstream from the site where transcription is initiated. Another 
sequence found 70 to 80 bases upstream from the start of transcription of many genes is a CXCAAT region 
where X may be any nucleotide. At the 3' end of most eukaryotic genes is an AATAAA sequence that may 
be the signal for addition of the poly A tail to the 3' end of the coding sequence. All of these sequences are 
suitably inserted into mammalian expression vectors. 

HRG gene transcription from vectors in mammalian host cells may be controlled by promoters 
obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1,504, published 5 July 
1989), adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus, hepatitis-B virus and most preferably Simian Virus 40 (SV40) t from heterologous mammalian 
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promoters, e.g.. the actin promoter or an immunoglobulin promoter, from heat-shock promoters, and from 
the promoter normally associated with HRG sequence, provided such promoters are compatible with the host 
cell systems. 

The early and late promoters of the SV40 virus are conveniently obtained as an SV40 restriction 
fragment that also contains the SV40 viral origin of replication (Fiers et ai. Nature, 273:113 (1978); 
Mulligan and Berg, Science. 209: 1422-1427 (1980); Pavlakis et ai, Proc. Nati Acad. Sci USA. 78: 7398- 
7402 (1981)). The immediate early promoter of the human cytomegalovirus is conveniently obtained as a 
Hind\\\ E restriction fragment (Greenaway et ai. Gene, 18: 355-360 (1982)). A system for expressing DNA 
in mammalian hosts using the bovine papilloma virus as a vector is disclosed in U.S. 4,419,446. A 
modification of this system is described in U.S. 4.601,978. See also Gray et ai. Nature, 295: 503-508 
(1982) on expressing cDNA encoding immune interferon in monkey cells; Reyes et ai. Nature, 297: 598- 
601 (1982) on expression of human b-interferon cDNA in mouse cells under the control of a thymidine 
kinase promoter from herpes simplex virus; Canaani and Berg, Proc, Natl. Acad. Sci. USA, 79: 5166-5170 
(1982) on expression of the human interferon bl gene in cultured mouse and rabbit cells; and Gorman et ai, 
Proc Natl. Acad. Sci USA, 79: 6777-6781 (1982) on expression of bacterial CAT sequences in CV-1 
monkey kidney cells, chicken embryo fibroblasts, Chinese hamster ovary cells, HeLa cells, and mouse NIH- 
3T3 cells using the Rous sarcoma virus long terminal repeat as a promoter. 

(v) Enhancer Element Component 

Transcription of a DNA encoding HRG of this invention by higher eukaryotes is often increased by 
inverting an enhancer sequence into the vector. Enhancers are cis-acting elements of DNA ; usually about 
from 10-300 bp, that act on a promoter to increase its transcription. Enhancers are relatively orientation and 
position independent having been found 5' (Laimins et ai, Proc. Natl. Acad. Sci USA, 78: 903, 1981) and 3' 
<Lusk\ et ai, Moi Cell Bio.. 3: 1 108, 1983) to the transcription unit, within an intron (Banerji et ai. Cell, 
33. 72 l >. 1983) as well as within the coding sequence itself (Osborne et ai, Moi Cell Bio., 4: 1293. 1984). 
Many enhancer sequences are now known from mammalian genes (globin. elastase, albumin, a-fetoprotein 
and insulin). Typically, however, one will use an enhancer from a eukaryotic cell virus. Examples include 
the SV40 enhancer on the late side of the replication origin (bp 100-270), the cytomegalovirus early 
promoter enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus enhancers 
(see also Yaniv, Nature, 297: 17-18 (1982)) on enhancing elements for activation of eukaryotic promoters. 
The enhancer may be spliced into the vector at a position 5* or 3' to HRG DNA, but is preferably located at a 
Mtc 5' from the promoter. 

(vi) Transcription Termination Component 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or 
nucleated cells from other multicellular organisms) will also contain sequences necessary for the termination 
of transcription and for stabilizing the mRNA. Such sequences are commonly available from the 5' and, 
occasionally 3' untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain 
nucleotide segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA 
encoding HRG. The 3* untranslated regions also include transcription termination sites. 

Construction of suitable vectors containing one or more of the above listed components the desired 
coding and control sequences employs standard ligation techniques. Isolated plasmids or DNA fragments 
are cleaved, tailored, and religated in the form desired to generate the plasmids required. 

For analysis to confirm correct sequences in plasmids constructed, the ligation mixtures are used to 
transform £. coli K12 strain 294 (ATCC 31,446) and successful transformants selected by ampicillin or 
tetracycline resistance where appropriate. Plasmids from the transformants are prepared, analyzed by 
restriction endonuclease digestion, and/or sequenced by the method of Messing et ai. Nucleic Acids Res. 9: 
309 (1981) or by the method of Maxam et ai, Methods in Enzymology 65: 499 (1980). 
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Particularly useful in the practice of this invention are expression vectors that provide for the 
transient expression in mammalian ceils of DNA encoding HRG. In general, transient expression involves 
the use of an expression vector that is able to replicate efficiently in a host cell, such that the host cell 
accumulates many copies of the expression vector and, in turn, synthesizes high levels of a desired 
5 polypeptide encoded by the expression vector. Transient expression systems, comprising a suitable 
expression vector and a host cell, allow for the convenient positive identification of polypeptides encoded by 
cloned DNAs. as well as for the rapid screening of such polypeptides for desired biological or physiological 
properties. Thus, transient expression systems are particularly useful in the invention for purposes of 
identifying analogs and variants of HRG that have HRG-like activity. Such a transient expression system is 

1 0 described in U.S. 5,024,939. 

Other methods, vectors, and host cells suitable for adaptation to the synthesis of HRG in 
recombinant vertebrate cell culture are described in Gething et aL* Nature 293: 620-625, 1981; Mantei et at., 
Nature, 281: 40-46, 1979; Levinson et al., EP 1 17,060 and EP 1 17,058. A particularly useful expression 
plasmid for mammalian cell culture expression of HRG is pRK5 (EP pub. no. 307,247). 

15 D. Selection and Transformation of Host Cells 

Suitable host cells for cloning or expressing the vectors herein are the prokaryote, yeast, or higher 
eukaryote cells described above. Suitable prokaryotes include eubacteria, su=h as Gram-negative or Gram- 
positive organisms, for example, E. coli. Bacilli such as B. subtilis. Pseudomonas species such as P. 
aeruginosa. Salmonella ryphimurium, or Serratia marcescans. One preferred E. coli cloning host is £. coli 

20 294 (ATCC 31,446), although other strains such as E. coli B, E. coli x \176 (ATCC 31,537), and E. coli 
W3 1 10 (ATCC 27.325) are suitable. These examples are illustrative rather than limiting. Preferably the host 
cell should secrete minimal amounts of proteolytic enzymes. Alternatively, in vitro methods of cloning, e.g., 
PCR or other nucleic acid polymerase reactions, are suitable. 

In addition to prokaryotes, eukaryotic microbes such as filamentous fungi or yeast are suitable hosts 

25 for HRG-encoding vectors. Saccharomyces cerevisiae, or common baker's yeast, is the most commonly used 
among lower eukaryotic host microorganisms. However, a number of other genera, species, and strains are 
commonly available and useful herein, such as Schizosaccharomyces pombe (Beach and Nurse, Nature. 290: 
140 (1981); EP 139,383, published May 2, 1985), Kluy\>eromyces hosts (U.S. S.N. 4.943,529) such as, e.g., 
K. lactis (Louvencourt et al, J. Bacterial., 737 (1983); K. /rag i I is. K. bulgaricus, K. thermototerans, and K. 

30 marxianus, yarrowia (EP 402.226); Pichia pastoris (EP 1 83,070). Sreekrishna ei at., J. Basic Microbiol., 28: 
265-278 (1988); Candida, Trichoderma reesia (EP 244,234); Neurospora crassa (Case et at., Proc. Natl. 
Acad. Sci. USA, 76: 5259-5263 (1979), and filamentous fungi such as. e.g, Neurospora, Penicillium. 
Tolypocladium (WO 91/00357, published 10 January 1991), and Aspergillus hosts such as A. nidulans 
(Ballance et at., Biochem. Biophys. Res. Commun., 112: 284-289 (1983); Tilbum et aL Gene, 26: 205-221 

35 (1983); Yelton et at., Proc. Natl. Acad. Sci USA, 81: 1470-1474 (1984) and A. niger (Kelly and Hynes, 
EM BO J., 4: 475-479 (1985)). 

Suitable host cells for the expression of glycosylated HRG polypeptide are derived from 
multicellular organisms. Such host cells are capable of complex processing and glycosylation activities. In 
principle, any higher eukaryotic cell culture is workable, whether from vertebrate or invertebrate culture. 

40 Examples of invertebrate cells include plant and insect ceils. Numerous baculoviral strains and variants and 
corresponding permissive insect host cells from hosts such as Spodoptera frugiperda (caterpillar), Aedes 
aegypti (mosquito), Aedes albopictus (mosquito), Drosophila meianogaster (fruitfly), and Bombyx mori host 
cells have been identified (see, e.g., Luckow et al., Bio/Technology, 6: 47-55 (1988); Miller et at., in Genetic 
Engineering, Setlow, J.K. et al, eds., VoL 8 (Plenum Publishing, 1986), pp. 277-279; and Maeda et al. % 

45 Nature, 315: 592-594 (1985)). A variety of such viral strains are publicly available, e.g., the L-l variant of 
Autographa californica NPV and the Bm-5 strain of Bombyx mori NPV, and such viruses may be used as the 
virus herein according to the present invention, particularly for transfection of Spodoptera frugiperda cells. 

30 
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Plant cell cultures of cotton, corn, potato, soybean, petunia, tomato, and tobacco can be utilized as hosts. 
Typically, plant cells are transfected by incubation with certain strains of the bacterium Agrobacterium 
tumefaciens. which has been previously manipulated to contain HRG DNA. During incubation of the plant 
cell culture with A, tumefaciens, the DNA encoding HRG is transferred to the plant cell host such that it is 
5 transfected, and will, under appropriate conditions, express HRG DNA. In addition, regulatory and signal 
sequences compatible with plant cells are available, such as the nopaline synthase promoter and 
polyadenylation signal sequences (Depicker et a/., J, MoL Appl. Gen., 1: 561 (1982)). in addition, DNA 
segments isolated from the upstream region of the T-DNA 780 gene are capable of activating or increasing 
transcription levels of plant-expressible genes in recombinant DNA-containing plant tissue (see EP 321,196, 
10 published 21 June 1989). 

However, interest has been greatest in vertebrate cells, and propagation of vertebrate ceils in culture 
(tissue culture) has become a routine procedure in recent years (Tissue Culture. Academic Press, Kxuse and 
Patterson, editors (1973)). Examples of useful mammalian host cell lines are monkey kidney CV1 line 
transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line (293 or 293 cells subcloned 
15 for growth in suspension culture, Graham et aL, J. Gen ViroL, 36: 59, 1977); baby hamster kidney cells 
(BHK, ATCC CCL 10); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad 
Sci. USA, 77: 4216 (1980)); mouse Sertoli cells (TM4, Mather. Biol. Reprod.. 23: 243-251 (1980)); monkey 
kidney cells (CV1 ATCC CCL 70); African green monkey kidney cells (VERO-76. ATCC CRL- 1587); 
human cervical carcinoma cells (HELA. ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); 
20 buffalo rat liver cells (BRL 3 A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver 
ceils (Hep G2. HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRJ cells (Mather et aL, 
Annals N.Y. Acad. Set., 383: 44-68 (1982)); MRC 5 cells; FS4 cells: r.nd a human hepatoma cell line (Hep 
G2). Preferred host cells are human embryonic kidney 293 and Chinese hamster ovary ceils. 

Host cells are transfected and preferably transformed with the above-described expression or 
25 cloning vectors of this invention and cultured in conventional nutrient media modified as appropriate for 
inducing promoters, selecting trans formants, or amplifying the genes encoding the desired sequences. 

Transfection refers to the taking up of an expression vector by a host ceil whether or not any coding 
sequences are in fact expressed. Numerous methods of transfection arc known to the ordinarily skilled 
artisan, for example, CaPC>4 and electroporation. Successful transfection is generally recognized when any 

30 indication of the operation of this vector occurs within the host cell. 

Transformation means introducing DNA into an organism so that the DNA is replicable. either as an 
extrachromosoma! element or by chromosomal integration. Depending on the host cell used, transformation 
is done using standard techniques appropriate to such cells. The calcium treatment employing calcium 
chloride, as described in section 1.82 of Sambrook et aL. supra, is generally used for prokaryotes or other 

35 cells that contain substantial cell-wall barriers. Infection with Agrobacterium tumefaciens is used for 
transformation of certain plant cells, as described by Shaw et aL, Gene, 23: 315 (1983) and WO 89/05859, 
published 29 June 1989. For mammalian cells without such cell walls, the calcium phosphate precipitation 
method described in sections 16.30-16.37 of Sambrook et aL supra, is preferred. General aspects of 
mammalian cell host system transformations have been described by Axel in U.S. 4,399,216. issued 16 

40 August 1983. Transformations into yeast are typically carried out according to the method of Van Solingen 
et aL. J. Bact. % 130: 946 (1977) and Hsiao et aL, Proc. Natl. Acad. Sci. (USA), 76: 3829 (1979). However, 
other methods for introducing DNA into cells such as by nuclear injection, electroporation, or protoplast 
fusion may also be used. 

E. Culturing the Host Cells 

45 Prokaryotic cells used to produce HRG polypeptide of this invention are cultured in suitable media 

as described generally in Sambrook et al., supra. 
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The mammalian host cells used to produce HRG of this invention may be cultured in a variety of 
media. Commercially available media such as Ham's FIO (Sigma), Minimal Essential Medium ((MEM), 
Sigma), RPM1-1640 (Sigma), and Dulbecco's Modified Eagle's Medium ((DMEM), Sigma) are suitable for 
culruring the host cells. In addition, any of the media described in Ham and Wallace, Meth. Em.. 58: 44 
5 (1979), Barnes and Sato, Anal Biochem^ 102: 255 (1980), U.S. 4.767.704; 4,657,866; 4,927,762; or 
4,560,655; WO 90/03430; WO 87/00195; U.S. Pat. Re. 30,985; or U.S. 5,122,469 may be used as culture 
media for the host cells. Any of these media may be supplemented as necessary with hormones and/or other 
growth factors (such as insulin. Transferrin, or epidermal growth factor), salts (such as sodium chloride, 
calcium, magnesium, and phosphate), buffers (such as HEPES), nucleosides (such as adenosine and 

10 thymidine), antibiotics (such as GENTAMYCIN drug), trace elements (defined as inorganic compounds 
usually present at final concentrations in the micromolar range), and glucose or an equivalent energy source. 
Any other necessary supplements may also be included at appropriate concentrations that would be known to 
those skilled in the an. The culture conditions, such as temperature, pH, and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled artisan. 

15 The host cells referred to in this disclosure encompass cells in in vitro culture as well as cells that 

are within a host animal. 

It is further envisioned that HRG of this invention may be produced by homologous recombination, 
or with recombinant production methods utilizing control elements introduced into cells already containing 
I )N A encoding HRG currently in use in the field. For example, a powerful promoter/enhancer element, a 
:n suppresses or an exogenous transcription modulatory element is inserted in the genome of the intended host 
cell in proximity and orientation sufficient to influence the transcription of DNA encoding the desired HRG. 
I he conirol element does not encode HRG of this invention, but the DNA is present in the host cell genome. 
( me ne\i screens for cells making HRG of this invention, or increased or decreased levels of expression, as 
desired 

25 F. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting, Northern blotting to quantitate the transcription of mRNA (Thomas, Proc. 
Natl Acad Sci USA, 11: 5201-5205 (1980)), dot blotting (DN.* analysis), or in situ hybridization, using an 
appropriately labeled probe based on the sequences provided herein. Various labels may be employed, most 

30 commonly radioisotopes, particularly 3 -P. However, other techniques may also be employed, such as using 
bionn-modified nucleotides for introduction into a polynucleotide. The biotin then serves as the site for 
binding to avidin or antibodies which may be labeled with a wide variety of labels, such as radionuclides, 
fluoresces, enzymes, or the like. Alternatively, antibodies may be employed that can recognize specific 
duplexes, including DNA duplexes, RNA duplexes, and DNA-RNA hybrid duplexes or DNA-protein 

35 duplexes. The antibodies in turn may be labeled and the assay may be carried out where the duplex is bound 
to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the 
duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, such as 
immunohistochemical staining of tissue sections and assay of cell culture or body fluids, to quantitate 

40 directly the expression of gene product. With immunohistochemical staining techniques, a cell sample is 
prepared, typically by dehydration and fixation, followed by reaction with labeled antibodies specific for the 
gene product coupled where the labels are usually visually detectable such as enzymatic labels, fluorescent 
labels, luminescent labels, and the like. A particularly sensitive staining technique suitable for use in the 
present invention is described by Hsu et al, Am. J. Clin. Path., 75: 734-738 (1980). 

45 Antibodies useful for immunohistochemical staining and/or assay of sample fluids may be either 

monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the antibodies may be 
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prepared against a native HRG polypeptide or against a synthetic peptide based on the DNA sequences 
provided herein as described further below. 

G. Purification of The Heregulin Polypeptides 
HRG is recovered from a cellular membrane fraction. Alternatively, a proteoiytically cleaved or a 
5 truncated expressed soluble HRG fragment or subdomain are recovered from the culture medium as a soluble 
polypeptide. A HRG is recovered from host cell lysates when directly expressed without a secretory signal. 

When HRG is expressed in a recombinant cell other than one of human origin, HRG is completely 
free of proteins or polypeptides of human origin. However, it is desirable to purify. HRG from recombinant 
cell proteins or polypeptides to obtain preparations that are substantially homogeneous as to HRG. As a first 
10 step, the culture medium or lysate is centrifuged to remove particulate cell debris. The membrane and 
soluble protein fractions are then separated. HRG is then be purified from both the soluble protein fraction 
(requiring the presence of a protease) and from the membrane fraction of the culture lysate, depending on 
whether HRG is membrane bound. The following procedures are exemplary of suitable purification 
procedures: fractionation on immunoaffinity or ion-exchange columns; ethanoi precipitation; reverse phase 
15 HPLC; chromatography on silica, heparin SEPHAROSE or on a cation exchange resin such as DEAE; 
chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; and gel filtration using, for example, 
SEPHADEX G-75. 

HRG variants in which residues have been deleted, inserted or substituted are recovered in the same 
fashion as the native HRG, taking account of any substantial changes in properties occasioned by the 

20 variation. For example, preparation of a HRG fusion with another protein or polypeptide, e.g., a bacterial or 
viral antigen, facilitates purification: an immunoafTinity column containing antibody to the antigen can be 
used to adsorb the fusion. Immunoaffmity coiumns such as a rabbit polyclonal anti-HRG column can be 
employed to absorb HRG variant by binding it to at least one remaining immune epitope. A protease 
inhibitor such as phenylmethylsulfonylfluoride (PMSF) also may be useful to inhibit proteolytic degradation 

25 during purification, and antibiotics may be included to prevent the growth of adventitious contaminants. One 
skilled in the art will appreciate that purification methods suitable for native HRG may require modification 
to account for changes in the character of HRG variants or upon expression in recombinant cell culture. 
. H. Covalent Modifications of HRG 
Covalent modifications of HRG polypeptides are included within the scope of this invention. Both 

30 native HRG and amino acid sequence variants of HRG optionally are covalently modified. One type of 
covalent modification included within the scope of this invention is a HRG polypeptide fragment. HRG 
fragments, such as HRG-GDF, having up to about 40 amino acid residues are conveniently prepared by 
chemical synthesis, or by enzymatic or chemical cleavage of the full-length HRG polypeptide or HRG 
variant polypeptide. Other types of covalent modifications of HRG or fragments thereof are introduced into 

35 the molecule by reacting targeted amino acid residues of HRG or fragments thereof with an organic 
derivatizing agent that is capable of reacting with selected side chains or the N- or C-terminal residues. 

Cysteinyl residues most commonly are reacted with a-haloacetates (and corresponding amines), 
such as chloroacetic acid or chloroacetamide, to give carboxymethyl or carboxyamidomethyl derivatives. 
Cysteinyl residues also are derivatized by reaction with bromotrifluoroacetone, a-bromo-P-(5- 

40 imidozoyl)propionic acid, chloroacetyl phosphate, N-alkylmaleimides, 3-nitro-2-pyridyl disulfide, methyl 2- 
pyridyl disulfide, p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, or chloro-7-nitrobenzo-2-oxa- 
1,3-diazole. 

Histidyl residues are derivatized by reaction with diethylpyrocarbonate at pH 5.5-7.0 because this * 
agent is relatively specific for the histidyl side chain. Para-bromophenacy! bromide also is useful; the 
45 reaction is preferably performed in 0. 1 M sodium cacodylate at pH 6.0. 
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Lysinyl and amino terminal residues are reacted with succinic or other carboxylic acid anhydrides. 
Derivatization with these agents has the effect of reversing the charge of the lysinyl residues. Other suitable 
reagents for derivatizing a-amino-containing residues include imidoesters such as methyl picolinimidate; 
pyridoxal phosphate; pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid; O-methylisourea; 2,4- 
5 pentanedione; and transaminase-catalyzed reaction with giyoxylate. 

Arginyl residues are modified by reaction with one or several conventional reagents, among them 
phenylglyoxal. 2,3-butanedione, 1,2-cyclohexanedione, and ninhydrin. Derivatization of arginine residues 
requires that the reaction be performed in alkaline conditions because of the high pK a of the guanidine 
functional group. Furthermore, these reagents may react with the groups of lysine as well as the arginine 

10 epsilon-amino group. 

The specific modification of ryrosyl residues may be made, with particular interest in introducing 
spectral labels into tyrosyl residues by reaction with aromatic diazonium compounds or tetranitromethane. 
Most commonly, N-acetyiimidizoie and tetranitromethane are used to form O-acetyi tyrosyl species and 3- 
nitro derivatives, respectively. Tyrosyl residues are iodinated using or ^Ij to prepare labeled proteins 

1 5 for use in radioimmunoassay, the chloramine T method described above being suitable. 

Carboxyl side groups (aspartyl or glutamyl) are selectively modified by reaction with carbodiimides 
(R'-N=ON-R'), where R and R' are different alkyl groups, such as l-cyciohexyl-3-(2-morphoiinyl-4-ethyl) 
carbodiimide or l-ethyl-3-(4-azonia-4.4-dimethylpentyl) carbodiirnide. Furthermore, aspartyl and glutamyl 
residues are converted to asparaginyl and glutaminyl residues by reaction with ammonium ions. 

20 Derivatization with bifunctional agents is useful for crosslinking HRG to a water-insoluble support 

matrix or surface for use in a method for purifying anti-HRG antibodies, and vice versa. Commonly used 
crosslinking agents include, e.g., U-bis(diazoaceryI)-2-phenylethane. glutaraldehyde, N- 
hydroxysuccinimide esters, for example, esters with 4-azidosalicylic acid, homobifunctional imidoesters, 
including disuccinimidyl esters such as 3,3'-dithiobis(succinimidylpropionate), and bifunctional maleimides 

25 such as bis-N-maleimido-1 ,8-octane. Derivatizing agents such as rnethyl-3-((p- 
azidophenyi)dithio)propioimidate yield photoactivatable intermediates that are capable of forming crosslinks 
in the presence of light. Alternatively, reactive water-insoluble matrices such as cyanogen bromide-activated 
carbohydrates and the reactive substrates described in U.S. 3,969.287; 3,691.016; 4,195,128; 4,247,642; 
4,229,537; and 4,330,440 are employed for protein immobilization. 

30 Glutaminyl and asparaginyl residues are frequently deamidated to the corresponding glutamyl and 

aspartyl residues, respectively. Alternatively, these residues are deamidated under mildly acidic conditions. 
Either form of these residues falls within the scope of this invention. 

Other modifications include hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyi residues, methyiation of the a-amino groups of lysine, arginine. and histidine side 

35 chains (T.E. Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)), acetylation of the N-terminai amine, and amidation of any C-terminal carboxyl group. 

HRG optionally is fused with a polypeptide heterologous to HRG. The heterologous polypeptide 
optionally is an anchor sequence such as that found in a phage coat protein such as M13 gene III or gene 
VIII proteins. These heterologous polypeptides can be covalently coupled to HRG polypeptide through side 

40 chains or through the terminal residues. 

HRG may also be covalently modified by altering its native glycosylation pattern. One or more 
carbohydrate substituents in these embodiments, are modified by adding, removing or varying the 
monosaccharide components at a given site, or by modifying residues in HRG as that glycosylation sites are 
added or deleted. 

45 Glycosylation of polypeptides is typically either N-linked or O-linked. N-linked refers to the 

attachment of the carbohydrate moiety to the side chain of an asparagine residue. The tri-peptide sequences 
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asparagine-X-serine and asparagine-X-threonine, where X is any amino acid except proline, are the 
recognition sequences for enzymatic attachment of the carbohydrate moiety to the asparagine side chain. 
Thus, the presence of either of these tri-peptide sequences in a polypeptide creates a potential glycosylation 
site. O-linked glycosylation refers to the attachment of one of the sugars N-acetylgalactosamine, galactose, 
5 or xylose, to a hydroxyamino acid, most commonly serine or threonine, although 5-hydroxyproline or 5- 
hydroxylysine may also be used. 

Glycosylation sites are added to HRG by altering its amino acid sequence to contain one or more of 
the above-described tri-peptide sequences (for N-linked glycosylation sites). The alteration may also be 
made by the addition of, or substitution by, one or more serine or threonine residues to HRG (for O-linked 
10 glycosylation sites). For ease, HRG is preferably altered through changes at the DNA level, particularly by 
mutating the DNA encoding HRG at preselected bases such that codons are generated that will translate into 
the desired amino acids. 

Chemical or enzymatic coupling of glycosides to HRG increases the number of carbohydrate 
substituents. These procedures are advantageous in that they do not require production of the polypeptide in 

15 a host cell that is capable of N- and O-linked glycosylation. Depending on the coupling mode used f the 
sugar(s) may be attached to (a) arginine and histidine, (b) free carboxyl groups, (c) free sulfhydryl groups 
such as those of cysteine, (d) free hydroxyl groups such as those of serine, threonine, or hydroxyproline, (e) 
aromatic residues such as those of phenylalanine, tyrosine, or tryptophan, or (0 the amide group of 
glutamine. These methods are described in WO 87/05330, published 1 1 September 1987, and in Aplin and 

20 Wriston (CRC Crit. Re\>. Biochem,, pp. 259-306 (1981 )). 

Carbohydrate moieties present on an HRG also are removed chemically or enzymatically. Chemical 
deglycosylation requires exposure of the polypeptide to the compound trifluoromethanesulfonic acid, or an 
equivalent compound. This treatment results in the cleavage of most or ail sugars except the linking sugar 
(N-acetylglucosamine or N-acetylgalactosamine), while leaving the polypeptide intact. Chemical 

25 deglycosylation is described by Hakimuddin et al. {Arch. Biochem. Biophys., 259:52 (1987)) and by Edge et 
aL {Anal. Biochem., 1 18:131 (1981)). Carbohydrate moieties are removed from HRG by a variety of endo- 
and exo- glycosidases as described by Thotakura et al. (Meth. EmymoL, 138:350 (1987)). 

Glycosylation also is suppressed by tunicamycin as described by Duskin et al. {J. Biol. Chem., 
257:3 105 (1982)). Tunicamycin blocks the formation of protein-N-glycoside linkages. 

30 HRG may also be modified by linking HRG to various nonproteinaceous polymers, e.g., 

polyethylene glycol, polypropylene glycol or polyoxyalkylenes. in the manner set forth in U.S. 4,640,835; 
4,496,689; 4,30 1 ,144; 4,670,4 1 7; 4,79 1,192 or 4, 1 79,337. 

One preferred way to increase the in vivo circulating half life of non-membrane bound HRG is to 
conjugate it to a polymer that confers extended half-life, such as polyethylene glycol (PEG). (Maxfield, et 

35 aL Polymer 16,505-509 (1975); Bailey, F. E., et aL in Nonionic Surfactants (Schick, M. J., ed.) pp. 794-821, 
1967); (Abuchowski, A. et al, J. Biol. Chem. 252, 3582-3586, 1977; Abuchowski, A. et aL, Cancer 
Biochem. Biophys. 7, 175-186, 1984); (Katre, N.V. et aL, Proc. Natl. Acad. Set., 84, 1487-1491, 1987; 
Goodson, R. et aL Bio Technology, 8, 343-346, 1990). Conjugation to PEG also has been reported to have 
reduced immunogenicity and toxicity (Abuchowski, A. etaL,J. Biol Chem., 252, 3578-358 1, 1977). 

40 HRG may also be entrapped in microcapsules prepared, for example, by coacervation techniques or 

by interfacia! polymerization (for example, hydroxymethylcellulose or gelatin-microcapsules and poly- 
(methyimethacylate) microcapsules, respectively), in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions, nano-particles and nanocapsules), or in macroemulsions. 
Such techniques are disclosed in Remington's Pharmaceutical Sciences, 16th edition, Osol, A., Ed., (1980). 

45 Those skilled in the art will be capable of screening variants in order to select the optimal variant for 

the purpose intended. For example, a change in the immunological character of HRG, such as a change in 
affinity for a given antigen or for the HER2 receptor, is measured by a competitive-type immunoassay using 
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a standard or control such as a native HRG (in particular native HRG-GFD). Other potential modifications 
of protein or polypeptide properties such as redox or thermal stability, hydrophobic ity, susceptibility to 
proteolytic degradation, stability in recombinant cell culture or in plasma, or the tendency to aggregate with 
carriers or into multimers are assayed by methods well known in the art. 
I. Heregulin Antibody Preparation 
The antibodies of this invention are obtained by routine screening and include polyclonal 
antibodies, monoclonal antibodies and fragments thereof. 

Polyclonal antibodies are preferably raised in animals by multiple subcutaneous (sc) or intraperitoneal 
(ip) injections of the relevant antigen and an adjuvant. It may be useful to conjugate the relevant antigen to a 
protein that is immunogenic in the species to be immunized, e.g., keyhole limpet hemocyanin, serum 
albumin, bovine thyroglobulin. or soybean trypsin inhibitor using a Afunctional or derivatizing agent, for 
example, maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine residues), N- 
hydroxysuccinimide (through lysine residues), glutaraldehyde, succinic anhydride, SOCI 2 , or r'n^ONR, 

where R and R 1 are different alkyl groups. 

Animals are immunized against the antigen, immunogenic conjugates, or derivatives by combining, 
e.g., 100 u,g or 5 fig of the protein or conjugate (for rabbits or mice, respectively) with 3 volumes of Freund's 
complete adjuvant and injecting the solution intradermally at multiple sites. One month later the animals are 
boosted with 1/5 to 1/10 the original amount of peptide or conjugate in Freund's complete adjuvant by 
subcutaneous injection at multiple sites. Seven to 14 days later the animals are bled and the serum is assayed 
for antibody titer. Animals are boosted until the titer plateaus. Preferably, the animal is boosted with the 
conjugate of the same antigen, but conjugated to a different protein and/or through a different cross-linking 
reagent. Conjugates also can be made in recombinant cell culture as protein fusions. Also, aggregating 
agents such as alum are suitably used to enhance the immune response. 

Monoclonal antibodies may be made using the hybridoma method first described by Kohler et ai, 
Nature, 256:495 (1975), or may be made by recombinant DNA methods (U.S. Patent No. 4,816,567). 

In the hybridoma method, a mouse or other appropriate host animal, such as a hamster or macaque 
monkey, is immunized as hereinabove described to elicit lymphocytes that produce or are capable of 
producing antibodies that will specifically bind to the protein used for immunization. Alternatively, 
lymphocytes may be immunized in vi/ro. Lymphocytes then are fused with myeloma cells using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: 
Principles and Practice, pp.59- 103 (Academic Press, 1986)). 

The hybridoma cells thus prepared are seeded and grown in a suitable culture medium that preferably 
contains one or more substances that inhibit the growth or survival of the unfused. parental myeloma cells. 
For example, if the parental myeloma cells lack the enzyme hypoxanthine guanine phosphoribosyl 
transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, 
aminopterin, and thymidine (HAT medium), which substances prevent the growth of HGPRT-deficient cells. 

Preferred myeloma cells are those that fuse efficiently, support stable high-level production of antibody 
by the selected antibody-producing ceils, and are sensitive to a medium such as HAT medium. Among 
these, preferred myeloma cell lines are murine myeloma lines, such as those derived from MOP-21 and 
M.C.-ll mouse rumors available from the Salk Institute Cell Distribution Center, San Diego, California 
USA, and SP-2 or X63-Ag8-653 cells available from the American Type Culture Collection, Rockville, 
Maryland USA. Human myeloma and mouse-human heteromyeloma cell lines also have been described for 
the production of human monoclonal antibodies (Kozbor, J. Immunol.. 133:3001 (1984); Brodeur et ai. 
Monoclonal Antibody Production Techniques and Applications, pp. 51-63 (Marcel Dekker, inc.. New York, 
1987)). 
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Culture medium in which hybridoma cells are growing is assayed for production of monoclonal 
antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies 
produced by hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). 

The binding affinity of the monoclonal antibody can. for example, be determined by the Scatchard 
analysis of Munson et aL, Anal. Biochem., 107:220 ( 1 980). 

After hybridoma cells are identified that produce antibodies of the desired specificity, affinity, and/or 
activity, the clones may be subcloned by limiting dilution procedures and grown by standard methods 
(Goding, Monoclonal Antibodies: Principles and Practice, pp.59- 103 (Academic Press. 1986)). Suitable 
culture media for this purpose include, for example, D-MEM or RPM1-1640 medium. In addition, the 
hybridoma cells may be grown in vivo as ascites tumors in an animal. 

The monoclonal antibodies secreted by the subclones are suitably separated from the culture medium, 
ascites fluid, or serum by conventional immunoglobulin purification procedures such as. for example, protein 
A-SEPHAROSE, hydroxy lapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

DNA encoding the monoclonal antibodies is readily isolated and sequenced using conventional 
procedures {e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding 
the heavy and light chains of the monoclonal antibodies). The hybridoma cells serve as a preferred source of 
such DNA. Once isolated, the DNA may be placed into expression vectors, which are then transfected into 
host cells such as £. coli cells, simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that 
do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the 
recombinant host cells. Recombinant production of antibodies will be described in more detail below. 

Hybridoma cell lines producing antibodies are identified by screening the culture sup^rnatants for 
antibody which binds to HER2/HER3 receptors. This is routinely accomplished by conventional 
immunoassays , using soluble receptor preparations or by FACS using cell-bound receptor and labeled 
candidate antibody. Agonist antibodies are preferably antibodies which stimulate autophosphorylation in the 
HRG tyrosine autophosphorylation assay described above. 

The hybrid cell lines can be maintained in culture in vitro in cell culture media. The cell lines of 
this inventior can be selected and/or maintained in a composition comprising the continuous cell line in 
hypoxanthine-aminopterin thymidine (HAT) medium. In fact, once the hybridoma cell line is established, it 
can be maintained on a variety of nutritionally adequate media. Moreover, the hybrid cell lines can be stored 
and preserved in any number of conventional ways, including freezing and storage under liquid nitrogen. 
Frozen cell lines can be revived and cultured indefinitely with resumed synthesis and secretion of 
monoclonal antibody. The secreted antibody is recovered from tissue culture supernatant by conventional 
methods such as precipitation, ion exchange chromatography, affinity chromatography, or the like. The 
antibodies described herein are also recovered from hybridoma cell cultures by conventional methods for 
purification of IgG or IgM as the case may be that heretofore have been used to purify these 
immunoglobulins from pooled plasma, e.g.. ethanol or polyethylene glycol precipitation procedures. 

Human antibodies may be used and are preferable. Such antibodies can be obtained by using 
human hybridomas (Cote et aL, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985)). 
Chimeric antibodies, Cabilly et aL U.S. 4,816,567, (Morrison et aL, Proc. Natl. Acad. Set., 81:6851 (1984); 
Neuberger et aL, Nature 312:604 (1984); Takeda et aL, Nature 314:452 (1985)) containing a murine anti- 
HER2/HER3 variable region and a human constant region of appropriate biological activity (such as ability 
to activate human complement and mediate ADCC) are within the scope of this invention, as are humanized 
antibodies produced by conventional CDR-grafting methods(Riechmann et aL, Nature 332:333-327(1988); 
EP 0328404 A 1 ; EP 02394000 A2). 

Techniques for creating recombinant DNA versions of the antigen-binding regions of antibody 
molecules (Fab or variable regions fragments) which bypass the generation of monoclonal antibodies are also 
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encompassed within the practice of this invention. One extracts antibody-specific messenger RNA 
molecules from immune system ceils taken from an immunized subject, transcribes these into 
complementary DNA (cDNA), and clones the cDNA into a bacterial expression system and selects for the 
desired binding characteristic. The Scripps/Stratagene method uses a bacteriophage lambda vector system 
containing a leader sequence that causes the expressed Fab protein to migrate to the periplasmic space 
(between the bacterial cell membrane and the cell wall) or to be secreted. One can rapidly generate and 
screen great numbers of functional Fab fragments to identify those which bind the receptors with the desired 
characteristics. Alternatively, the antibodies can be prepared by the phage display techniques described in 
Hoogenboom, Tibtech February 1997 (vol 15); Neri et aU Cell Biophysics 27:47-61 (1995); Winter et al. y 
Annu. Re\>. Immunol. 12:433-55 (1994); and Soderlind et ai. Immunol. Rev, 130:109-124 (1992) and the 
references described therein as well as the monovalent phage display technique described in Lowman et al. y 
Biochem. 30: 1 0832- 1 0838 ( 1 99 1 ). 

2. Therapeutic compositions, administration and use of Heregnlins 

and Agonist Antibodies 

The HRG are used in the present invention to induce epithelial cell growth, for example lung 
epithelial cell growth, proliferation and differentiation , and to increase the production of surfactant protein 
A by lung cells. These effects allow treatment of disease states associated with tissue damage, for example, 
chronic obstructive pulmonary disease(COPD) including subtypes thereof such as chronic bronchitis, 
emphysema, asthma, etc., neonatal pulmonary diseases including neonatal respiratory distress syndrome, 
meconium aspiration syndrome, chronic lung disease of the neonate, congenital diaphragmatic hernia, etc., 
acute lung injuries including smoke or chemical inhalation, pneumonitis due to aspiration, radiation, etc., 
near drowning, cystic fibrosis and other epithelial cell trauma diseases, including injuries associated with 
surgical wounds and resections, ulcers, lesions, and tissue tears, with the method of the invention. 

A preferred indication for treatment with the method of the invention is the treatment of COPD. 
COPD is a spectrum of chronic inflammatory respiratory diseases characterized by cough, sputum, dyspnea, 
airflow limitation and impaired gas exchange. COPD is common in older populations and presents a pattern 
of gradually declining lung function. Typically, a patient will exhibit a chronic cough with clear sputum 
whi"h worsens to a cough with thick sputum and accompanying poor air exchange. These conditions 
frequently lead to heart disease and death. Many persons with COPD will have chronic bronchitis together 
with emphysema. The present invention is particularly important because it halts, slows and/or reverses the 
lung destruction process in COPD patients. In this action, the method of the invention is very different from 
typical treatments for COPD in which the symptoms are treated, but not the underlying destruction of lung 
cell tissue and function. 

The method of the invention may, however, be combined with or administered together with other 
therapies for treatment of lung disease such as COPD. For example, the method of this invention can be 
used together with the administration of an anticholinergic bronchodilator such as ipratropium bromide 
(ATROVENT available from Boehringer Ingleheim) or tiotropium, a beta adrenergic receptor agonist such 
as albuterol (PROVENTIL available from Schering) or salmeterol, steroids such as prednisone, retinoic acid, 
phosphodiesterase inhibitors, endothelin antagonists, metalloproteinase inhibitors, elastase inhibitors, free 
radical inhibitors, serine proteinase inhibitors, neutrophil elastase inhibitors, pulmonary surfactant 
compositions such as beractant (SURVANTA available from Ross Labs.) , PDGF, FGF, EGF, growth 
hormone or other protein growth factors, etc. or combinations thereof. The relative amount of the HRG and 
the additional compound(s) can be readily determined by a physician with regard to the individual 
symptoms of the patient. It is anticipated that these compositions and the relative amounts of the 
components therein will be varied as necessary to address the specific needs of a patient and will be 
monitored and adjusted using conventional physiochemical and medical tests for lung function. 
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Therapeutic formulations of HRG or agonist antibody are prepared for storage by mixing the HRG 
protein having the desired degree of purity with optional physiologically acceptable carriers, excipients, or 
stabilizers (Remington's Pharmaceutical Sciences, supra), in the form of lyophilized cake or aqueous 
solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages and 
concentrations employed, and include buffers such as phosphate, citrate, and other organic acids; 
antioxidants including ascorbic acid: low molecular weight (less than about 10 residues) polypeptides; 
proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic polymers such as 
polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine or lysine; 
monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrins; chelating 
agents such as EDTA; sugar alcohols such as maruiitol or sorbitol; salt-forming counterions such as sodium; 
and/or nonionic surfactants such as TWEEN, PLURONICS or polyethylene glycol (PEG). 

HRG or agonist antibody to be used for in vivo administration must be sterile. This is readily 
accomplished by filtration through sterile filtration membranes, prior to or following lyophilization and 
reconstirutioh. The HRG or antibody ordinarily will be stored in lyophilized form or in solution. 

Therapeutic HRG or antibody compositions generally are placed into a container having a sterile 
access port, for example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic 
injection needle. 

The route of HRG or antibody administration is in accord with known methods, e.g., injection or 
infusion by intravenous, intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial, powder or 
liquid aerosol administration to the nose or lung or intralesional routes, or by sustained release systems as 
noted below. The HRG ligand may be administered continuously by infusion or by bolus injection. An 
agonist antibody is preferably administered in the same fashion, or by administration into the blood stream or 
lymph. 

The HRG, HRG variant or fragment and agonist antibodies may be spray dried or spray freeze dried 
using known techniques (Yeo et al, Biotech, and Bioeng., 41:341-346 (1993); Gombotz et al, 
PCT/US90/02421). 

Suitable examples of sustained-release preparations include semipermeable matrices of solid 
hydrophobic polymers containing the protein, which matrices are in the form of shaped articles, e.g. films, or 
microcapsules. Examples of sustained-release matrices include polyesters, hydrogels (e.g., poly(2- 
hydroxyethyl-methacrylate) as described by Langer et al.. J. Biomed. Mater. Res.. 15: 167-277 (1981) and 
Langer, Chem. Tech., 12: 98-105 (1982) or poly(vinyialcohol)), polyiaciides (U.S. 3.773,919, EP 58.481), 
copolymers of L-glutamic acid and gamma ethyl-L-glutamate (Sidman et al, Biopolymers. 22: 547-556 
(1983)), non-degradable ethylene-vinyl acetate (Langer et al.. supra), degradable lactic acid-glycolic acid 
copolymers such as the LUPRON DEPOT (injectable microspheres composed of lactic acid-glycolic acid 
copolymer and leuprolide acetate), and poly-D-(-)-3-hydroxyburyric acid (EP 133,988). While polymers 
such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release of molecules for over 100 days, 
certain hydrogels release proteins for shorter time periods. When encapsulated proteins remain in the body 
for a long time, they may denature or aggregate as a result of exposure to moisture at 37°C, resulting in a loss 
of biological activity and possible changes in immunogenicity. Rational strategies can be devised for protein 
stabilization depending on the mechanism involved. For example, if the aggregation mechanism is 
discovered to be intermolecuiar S-S bond formation through thio-disulfide interchange, stabilization may be 
achieved by modifying sulfhydryl residues, lyophilizing from acidic solutions, controlling moisture content, 
using appropriate additives, and developing specific polymer matrix compositions. 

Sustained -re lease HRG or antibody compositions also include liposomally entrapped HRG or 
antibody. Liposomes containing HRG or antibody are prepared by methods known per se: DE 3,218,121; 
Epstein et al.. Proc. Natl. Acad Set. USA. 82: 3688-3692 (1985); Hwang et at., Proc. Natl. Acad ScL USA. 
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77: 4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142.641; Japanese patent 
application 83-1 18008; U.S. 4,485,045 and 4,544,545; and EP 102,324. Ordinarily the liposomes are of the 
small (about 200-800 Angstroms) unilamelar type in which the lipid content is greater than about 30 mol. % 
cholesterol, the selected proportion being adjusted for the optimal HRG therapy. Liposomes with enhanced 
5 circulation time are disclosed in U.S. patent 5,013,556. 

An effective amount of HRG or antibody to be employed therapeutically will depend, for example, 
upon the therapeutic objectives, the route of administration, and the condition of the patient. Also, the 
amount of HRG polypeptide will generally be less than the amount of an agonist antibody. Accordingly, it 
will be necessary for the therapist to titer the dosage and modify the route of administration as required to 

10 obtain the optimal therapeutic effect. A typical daily dosage might range from about 1 pg/kg to about I 
mg/kg and up to 100 mg/kg or more, depending on the factors mentioned above. Typically, the clinician will 
administer HRG or antibody until a dosage is reached that achieves the desired effect. The progress of this 
therapy is easily monitored by conventional assays, for example, surfactant protein A production. 

In a further embodiment, epithelial cells may be obtained or isolated from a mammalian tissue to 

!5 obtain a normal epithelial ceil sample using techniques well known in the art (biopsy, etc.). This sample may 
then be treated with a heregulin protein in order to induce epithelial cell growth and/or proliferation in the 
sample thereby expanding the population of primary epithelial cells. Typically, heregulin will be added to 
the in vitro epithelial cell culture at a concentration of about 0.1 to about 100 nM preferably 1-50 nM. If 
desired, the primary epithelial cells may be cultured in vitro for several generations in order to sufficiently 

20 expand the epithelial cell population. The epithelial cells are cultured under conditions suitable for 
mammalian cell culture as discussed above. After expansion, the expanded sample is reintroduced into the 
mammal for the purpose of re-epithelializing the mammalian tissue. For example, lung epithelial cells 
isolated from a patient having emphysema or chronic obstructive pulmonary disease may be obtained, 
expanded and reintroduced into the lung in order to more quickly re-epithelial ize the damaged lung tissue 

25 thereby reestablishing lung function. The expanded cells may be reintroduced into the lung by aspiration or 
intubation using methods well known in the an. 

The methods and procedures described herein with respect to HRG-oc or HRG in general may be 
applied similarly to other HRG such as HRG-Pl, HRG-P2 and HRG-P3 and to variants thereof, as well as to 
the antibodies. The following examples are offered by way of illustration and not by way of limitation. 

30 EXAMPLES 

Example I - Preparation of Heregulins 

(a) Heregulins HRG-a, HRG-pl, HRG-p2, HRG-p2-like, and HRG-p3 were isolated, cloned, 
expressed and isolated from the cell culture medium as described in U.S. 5,367,060. 

(b) SMDF polypeptides are prepared as described in WO 96/1 5244. 

35 (c) y-HRG polypeptide was prepared and characterized as described below. 

Reagents: The EGF-like domain of HRGpi^ 177-244) was expressed in £. coli, purified and 

radioiodinated as described previously (Sliwkowski et al. J. Biol. Chem. 269:14661-14665 (1994)). The 
anti-HER2 monoclonal antibodies 2C4 and 4D5 have been described elsewhere (Fendly et aL Cancer 
Research 50:1550-1558(19 90)). 

40 HER3 and HER4-immunoadhesins: A unique Ml I site was engineered into a plasmid expressing 

human IgG heavy chain at the region encoding the hinge domain of the immunoglobulin. Ml I sites were 
also engineered into a set of HER expression plasmids at the region encoding the ECD/TM junctions of these 
receptors. All mutageneses were done using the Kunkel method (Kunkel. T., Proc. Natl. Acad. Set. U.S.A. 
82:488 (1985)) The Ml I sites were utilized to make the appropriate HER-IgG fusion constructs. The fusion 

45 junctions of the various HER-IgG chimeras were: for HER2, E 646 (lER 2-(TR)-DKTH 224 VH ; for HER3, 
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L 656 HER3 -(TR)-DKTH 224 VH ; for HER4, G 64,0 HER 4-(TR).DKTH 224 Vi| . The conserved TR sequence is 
derived from the Ml I site. The final expression constructs were in a pRK-type plasmid backbone wherein 
eukaryotic expression is driven by a CMV promoter (Gorman et al., DNA Prof. Eng. Tech. 2:3-10 (1990)). 

To obtain protein for in vitro experiments, adherent HEK-293 cells were transfected with the 
appropriate expression plasmids using standard calcium phosphate methods (Gorman et al., supra and Huang 
et ai. Nucleic Acids Res. 18:937-947 (1990)). Serum-containing media was replaced with serum-free media 
15 hours post-transfection and the transfected cells incubated for 5-7 days The resulting conditioned media 
was harvested and passed through Protein A columns (1 mL Pharmacia HiTrap™). Purified IgG fusions 
were eluied with 0.1 M citric acid (pH 4.2) into tubes containing I M Tris pH 9.0. The eluted proteins were 
subsequently dialyzed against PBS and concentrated using Centri-prep-30 filters (Amicon). Glycerol was 
added to a final concentration of 25% and the material stored at -20 C Concentrations of material were 
determined via a Fc-ELISA. 

Cell Culture: Human breast cancer cell lines MDA-MB-I75, MDA-MB-23 1, SK-BR-3 and MCF7 
were obtained from the American Type Culture Collection and maintained in a 50:50 mixture of F12 Ham's 
and Dulbecco's modified Eagle medium (DM EM), supplemented with 10% heat inactivated FBS, 2 mM 
glutamine and 10% penicillin-streptomycin. 

Generation and Characterization of cDNA Library: Total RNA was purified from MDA-MB-175 
cells using the guanidinium isothiocyanate-cesium chloride procedure (Sambrook et al. Molecular Cloning; 
t laboratory Manual (New York: Cold Spring Harbor Laboratory Press, (1989)). Poly (A)' RNA was 
ivoljtcd using oligo (dT) Dynabeads (DYNAL) as recommended by the supplier. First and second strand 
syntheses were performed using a Gibco BRL cDNA s: . thesis kit. XgtIO cDNA recombinants were 
-eneraicd when a cDNA cloning system from Amersham was used. In vitro packaging was performed using 
tnuapack II packaging extract (Stratagene). Pstl-Xhol HRGP3 cDNA fragment (nt 144-618) was labeled by 
random priming and 1 x 10 6 plaques were screened. Positive clones were confirmed and purified by 
secondary and tertiary screening. Phage DNA was isolated as a BamHI fragment and subcioned into the 
corresponding site of pBluescript SK' . Clone 5 was completely sequenced using the Sequenase version 2.0 
DNA sequencing kit (United States Biochemicals. Inc.). Both strands were sequenced. 

Bacterial Expression System: A cDNA fragment of clone 5 (nt 1690- 2722) was subcioned into the 
pET-32 TRX fusion vector (Novagen). This BglH-BgUI fragment was inserted into the BamHI site of the 
pET32a plasmid. The trxy-HRG (amino acids 455-768) protein expression in E.coli was induced as 
recommended by the supplier. 

Purification of Recombinant y-HRG: E.coli cells expressing trxy-HRG were collected and 
suspended at 9 ml/g in 50 mM Tris HCL pH 8. Lysozyme was added to a final concentration of 0.2 mg/ml 
and the solution was stirred on ice for 1 hr. Dnase 1 (10 u.g/ml) and MgCI : (4 mM) were added. The 
solution was then sonicated for 30 min and cell pellets collected afterwards. The pellet fraction was 
dissolved at 250 ml/g in 6 M Gdn HCL. 0.1 M Tris HCL, pH 8.8. Solubiiized proteins were sulfitolyzed by 
adding 1/10 volume of 1 M Na 2 S0 3 and 1/10 volume of 0.2 M Na 2 S 4 0 6 . The reaction was allowed to 
proceed for 1.5 hours at room temperature and protein was purified by gel filtration chromatography using a 
High Load Superdex™ 75 prep grade column (Pharmacia). Refolding was initiated by the addition of 1 mM 
cysteine, and 10 mM methionine was added as an antioxidant and incubated overnight at room temperature. 
Protein concentration was determined by quantitative amino acid analysis. 

Northern and Southern Hybridization: Total RNA was isolated by the method of Chomczynski et 
al. Anal Biochem. 162:156-159 (1987). Poly (A) + was isolated using oligo d(T) cellulose columns (Qiagen) 
as recommended by the supplier. RNA was denatured and size fractioned in a 0.8% formaldehyde/ 1% 
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agarose gel and transferred onto nylon membrane (Hybond. Amersham). RNA was UV crosslinked (UV 
Stratalinker. Stratagene). Prehybridization was carried out at 42 C in 50 %formamide/ 1% SDS/ I M NaCl, 
10 % dextransulfate and 100 u.g/ml herring sperm DNA for at least 2 hours. cDNA probes using either a 
restriction fragment with complementary sequence to the EGF-Iike domain of HRG03 or a Kpnl-Avall 
cDNA fragment encoding the unique sequence of y-HRG (nt 1 238- 1 868) were radiolabeled by random 
priming (Prime-It II, Stratagene). Hybridiziation was done in equal solution at 42°C containing the 32 P 
labeled fragments for 16 nr. Blots were washed several times with 2 x SSC/ 1% SDS at room temperature; 
washed with the same solution at 65°C for 20 min and finally washed with 0.2 x SSC/ 0.1% SDS at room 
temperature for 1 5 min. The blots were air dried and exposed to Du Pont Reflection™ film with intensifying 
screens at -80°C for 7-40 hours. Human multiple tissue Northern blots (Clontech) containing 2 u.g poly (A)* 
from spleen, thymus, prostate, testis, ovary, small intestine, colon, peripheral blood leukocytes, heart, brain, 
placenta, lung, liver, skeletal muscle, kidney and pancreas were hybridized with a radiolabeled y-HRG 
cDNA probe (nt 841-1447) as recommended by the supplier. 

MDA-MB-175 and MDA-MB-23I genomic DNA was isolated as described in Sambrook et al % 
supra. DNA was digested with different restriction enzymes, prior to transfer treated with 0.25 N HCI and 
transferred onto nylon membrane (Hybond. Amersham). Bglll-Ndel cDNA fragment of y-HRG (nt 1690- 
2351) was also radiolabeled by random priming and used as a hybridization probe- Prehybridization was 
carried out in 6 x SSC/ 5 x DenhardtV 0.75% SDS, 10% Dextransulfate and 100 u.g/ml herring sperm DNA 
at 68 C for 4 hours and hybridization with radiolabeled probe was done overnight. The same wash conditions 
as for Northern blots were used except a wash step with 0.2 x SSC/ 0.1% SDS at 68°C was added and 
detection was pursued as described above. 

!2:> I-HRG Binding Assay: Binding assays were performed in Nunc breakapart strip wells. Plates 

were coated at 4°C overnight with 100 u,l of 5 u.g/mi goat-anti-human antibody (Boehringer Mannheim) in 
50 mM carbonate buffer (pH 9.6). Plates were rinsed twice with wash buffer (PBS/ 0.05% Tween-20 ) and 
blocked with 100 u.1 1% BSA/ PBS for 30 min. Buffei *as removed and each well was incubated with 15 ng 
IgG fusion protein in 1% BSA/ PBS under vigorous shaking for 1.5 hours. Plates were rinsed three times 
with wash buffer and competitive binding was carried out by adding various amounts of y-HRG and Ifc5 I- 
HRGpl under vigorous shaking. After incubation for 1.5-2 hours, wells were rinsed three limes with wash 
buffer, drained and individual wells were counted using a 100 Series Iso Data y-counter. 

Tyrosine Phosphorylation Assay: MCF7 ceils were plated in 24 well plates at 1 x I0 5 cells/ well in 
FI2/DMEM containing 10% FBS. After 48 hours, cells were washed with serum free F12/DMEM and 
serum starved for 6 hours. Various concentrations of bacterial expressed truncated y-HRG (i.e., 0 pM, 22 
pM, 66 pM, 200 pM and 600 pM trxy-HRG) or unpurified conditioned medium of MDA-MB-175 cells were 
prepared in binding buffer (0.1% BSA in F12/DMEM) and added to each well. After 8 min incubation at 
room temperature, media was carefully aspirated and reactions were stopped by adding 100 u.1 of sample 
buffer (5% SDS, 0.25% 2-mercaptoethanol, 25 mM Tris-HCL pH 6.8). 20 u,l of each sample was size 
fractionated in a 4-12% gradient gel (Novex) and then electrophoreticaily transferred onto nitrocellulose 
membrane. Antiphosphoryrosine (4G10, from UBI, used at I u.g/ml) immunoblots were developed and the 
predominant reactive band at M r -180 kDa was quantified by reflectance densitometry. 

Production and characterization of conditioned medium from MDA-MB-175 cells, Cells were 
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seeded in T175 flasks and grown until reaching 70-80% confluency ( - 2.5 x 10 cells/ flask). Subsequently, 
cells were washed with PBS and grown in serum free F12/ DMEM medium for 3-4 days. Medium was then 
collected, filtered and concentrated using an ultrafiltration cell with YM10 Diafio ultafiltration membranes 
(Amicon). y-HRG was visualized in conditioned medium of MDA-MB-175 cells by Western blot analysis 
5 under non reducing conditions. y-HRG was partially purified by HPLC using a C4 reverse phase column. 
CHO expressed full length HRGpi (lane 1) and semi pure y-HRG (lane 2) were electrophoresed, blot was 
probed with HER2/HER4 IgG heterodimers and Western blot was developed. A - 64 kDa band could be 
seen in the lane containing partial purified supernatant whereas CHO expressed full length HRGPl migrated 
as a 45 kDa protein. 

10 Cell Proliferation Assay with Crystal Violet: Tumor cell lines were plated in 96 well plates at 

following densities: 2 x 10 4 cells/well for MDA-MB-175 and 1 x 10 4 cells/well for SK-BR-3. The media 
contained 1% FBS and cells were allowed to adhere for 2 hours. Monoclonal antibodies, immunoadhesions 
(10 |ig/ml) or media alone were added and the cells were incubated for 2 hours at 37°C. rHRGpi 177.244 was 
added at a final concentration of ! nM, or 100 nM for neutralising the irnmunoadhesion, and the cells were 

15 incubated for 4 days. Monolayers were washed with PBS and stained/fixed with 0.5% crystal violet. Plates 
were air dried, the dye was eluted with 0. 1 M sodium citrate (pH 4.2) in ethanol (50:50) and the absorbance 
was measured at 540 nm. 

Isolation and sequence analysis of y-HRG: To characterize the heregulin transcript in MDA-MB- 
175 cells, a *gt 10 cDNA library was constructed with mRNA derived from this ceil line. The library was 

20 screened with a cDNA probe corresponding to the EGF-like domain and pan of the N-terminal sequence of 
HRGp3. Various clones were identified. One of the clones which appeared to contain the full length cDNA 
sequence was isolated and sequenced. Fig.7A-7D shows the nucleotide sequence and the predicted amino 
acid sequence of y-HRG. The single open reading frame of 2303 bp starts with an ATG codon at nt 334. 
This start codon lies in a nucleotide sequence context, which is known to be a potential translation initiation 

25 site (Kozak. Nucleic Acid Research 15:8123-8148 (1987)). Several termination codons were found upstream 
of the initiation codon. The stop codon TAG at nt 2637 is followed by the 3' noncoding sequence, which is 
identical to other HRG isoform sequences and includes a polyadenyiation signal followed by an A-rich 
region. The open reading frame encodes a protein of 768 amino acid residues with a calculated molecular 
mass of 84.2 kDa. 

30 (d) The selection of HRG-Pl variants containing residues corresponding to the minimal EGF-like 

domain (HRG-Pl 177-228) was conducted using monovalent phage display. For these variants, residue 
numbers also are expressed, in parentheses, in terms of the position of the residue in the minimal EGF-like 
domain (i.e., HRG-pl EGF 1-52). 

Variants of HRG-Pl EGF were prepared and selected for binding to HER-3-lg using monovalent 

35 phage display, according to the method of Bass et ai. Proteins 8:309-314 (1990). As discussed in detail 
below, an HRG-Pl EGF phagemid vector was prepared, in which HRG-pl EGF was fused to a C-terminal 
fragment of the M 13 coat protein pill. Kunkel mutagenesis was performed to introduce stop codons into this 
vector at sites selected for randomization. This step ensures that the starting vector is incapable of 
expressing the wild-type polypeptide. Stretches of four to six residues per library were randomized in a 

40 linear fashion, except for the six cysteines, Phel89 (HRG-pi EGF Phel3) and the two most C-terminal 
residues. Phel 89 was not altered because this residue is conserved as an aromatic residue in EGF and TGF-a 
and forms a stacking interaction with Tyr208 (HRG-pl EGF Tyr32) Jacobsen et al.. Biochemistry 35:3402- 
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17 (1996). HRG-Pl EGF was thus covered in eight libraries, designated A-E. G. H and I. 

Library E. covering HRG-Pl 202-209 (HRG-Pl EGF 26-33), contained a three-residue deletion. 
The deleted region corresponds to a disordered turn between the second and third P-sheet of HRG-pi EGF, 
and the equivalent amino acids are absent in EGF and TGF-cc. An HRG-pl EGF control variant in which 
5 HRG-Pl 202-204 (HRG-pl EGF 26-28) of HRG8 are deleted (HRG63) bound HER-3-Ig with an affinity 
similar to that of wild-type. 

An additional library (F) was created to randomize a surface patch composed of side chains from 

the first and second p-sheets, which included HRG-pl 178. 180, 198, and 200 (HRG-pi EGF 2, 4, 22, and 
24). 

10 The selected sites in the starting vectors were randomized by Kunkel mutagenesis to produce HRG- 

Pl EGF libraries. Phage displaying mutated HRG-Pl EGFs were produced from the libraries under 
conditions such that, statistically; each phage particle displayed no more than one copy of the mutated HRG- 
Pl EGF. See Bass ei a/., supra. These phage were then selected for binding to (sorted against) HER-3-Ig 
immobilized on an ELISA plate. Bound phage were eluted and used to reinfect host cells, which were used 

! - 10 produce new phage for another round of sorting. This process was repeated six to seven times for each 
library. Twelve clones from the phage selected from each library were then sequenced. 

Table 2: Library A Variants 









Position in HRG-pl 








Variant No. 


177 


178 


179 


180 


181 






Wild-type 
1 


S 


H 


L 


V 


K 






W 


R 






P 






-> 


w 


S 




Q 


P 






3, 5, 10 


w 


E 






P 






4 


w 


S 










25 


6 


w 


S 




i 


P 






7 


w 


R 






A 






8 


w 


A 






P 






9 


w 


S 




Q 








I 1 


w 


E 






A 




30 


12 


w 


S 




E 


P 










Tabie 3: 


Library B Variants 












Position in HRG-pl 








Variant No. 


183 


184 


185 


186 


187 


188 


35 


Wild-type 


A 


E 


K 


E 


K 


T 




1* 


G 


V 


G 


R 


D 


G 




2* 


G 


G 


E 


R 


E 


G 






G 




E 


R 


E 


G 




4*, 5* 


G 


w 


D 


R 


E 


G 


40 


6* 


G 


V 


0 


R 


E 


G 




7 


G 




E 


R 


A 


G 




8 


G 


k 


E 


R 


E 


G 




9* 


T 


N 


S 


R 


E 


G 




10* 


D 


K 


S 


R 


E 


G 


45 


11* 


G 




D 


R 




O 




12 


G 


R 


E 


R 


E 


G 



•Variant also contained Met226Ile. 
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Table 4: Library C Variants 








Position in HRG-Pl 




Variant No. 


191 


192 193 


194 


195 


Wild-type 


V 


N G 


G 


E 


I "i A ^ 7 1 *> 

1, Z« H, J, /- 1 Z. 










3 








V 


6 








0 






Table 5: Library D Variants 








Position in HRG-pl 




Variant No. 


197 


198 199 


200 


201 


Wild-type 


F 


M V 


K 


D 


IV 2*, 8*, 12* 


Y 


K 


R 


1 


3 




R 




T 


4,5,7,9 


Y 


R 




T 


6 


Y 






Y 


10 


Y 






T 


1 1 


M 


R 


R 


T 


♦Variant also contained Met226Ilc. 







25 



30 



35 



40 



45 



50 



55 



Table 6: Library E Variants 
Position in HRG-Ql 



Variant Jc. 


205 


206 


207 


208 


209 


Wild-type 
1 


P 


S 


R 


Y 


L 


T 


P 


Y 


L 


M 


2, 4 . 


Y 


G 


Y 


L 


M 


3* 


Y 


R 


Y 


R 


M 


5. 12 


T 


H 


Y 


R 


G 


6 


T 


H 


Y 


R 


M 


7* 


Y 


K 


Y 


R 


M 


8,9 


T 


K 


Y 


R 


G 


10 


Y 


K 


Y 


R 





* Variant also contained Met226Ile. 



Variant No, 
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Table 7: Library G Variants 

Position in HRG-Pl 

212 213 214 215 



Wild-type 
1,5,6, 10, 12 
2 

3 
4 

7, 8 
9 
I 1 



K 
R 
R 

R 
R 



P 

S 
S 

T 
T 
N 
K 



N 
L 
E 
K 
V 
V 

s 

K 



216 



M 
Y 
Y 



Example 2 - HER2/HER3 expression in embryonic rat lung 

Rat lungs were microdissected from rat embryos on embryologic day (E) 16, 18, 20, and post-natal 
(P) days 7, 14, and adult. Isolated lung tissue was homogenized in a standard protease inhibitor buffer, and 
equal protein amounts subjected to SDS-PAGE (4-20%), blotted to nitrocellulose and identified with specific . f 
antibodies (HER2, HER3, HER4-Santa Cruz Biological, San Jose, CA. and HRG, 3G1 1, Genentech, Inc.) 
using chemiluminescent techniques. 

Analysis of the blot indicates that HER2 is expressed at high levels throughout development in 
utero. appears to peak on El 8 and declines after birth to lower adult levels. HER3 expression peaks at El 8, 
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and then declines after birth to lower adult levels. HER4 is not identified at any time during lung 
development. A proform of HRG may be present (75 kDa protein) on EI 6. peaking on El 8 and then 
declining to low levels in the adult. These data suggest that the HER/HRG system is developmental^ 
modulated during lung development and the active receptor may be a HER2/HER3 heterodimer. The time 
5 period during which the receptors peak in expression (EI 8) represents the pseudoglandular stage of rat lung 
development (days 13-18), bordering on the canalicular stage (days 19-20). During the pseudoglandular 
stage, pulmonary epithelial ceil proliferation is higher than at any other time. During the canalicular stage, 
differentiation begins with the appearance of type 1 pneymocyte and type II (surfactant producing) 
pneymocytes. This indicates a role for HRG/HER interaction in either or both processes. 

10 Example 3 - HER2/HER3 expression in fetal human lung 

Fetal human lung was obtained from mid trimester embryos (17-22 weeks). Lung tissue was 
cultured in serum free Weymouth's media at an air fluid interface at 37°C in a humidified 5% COo 
atmosphere. Tissue was harvested from the culture on days 0 (day tissue was received) and after 1-4 days 
(D) in culture (D1-D4), homogenized in a standard protease inhibitor buffer equal protein amounts subjected 

1 5 to SDS-PAGE (4-20%), blotted to nitrocellulose and identified with specific antibodies. 

HER2 is expressed on DO and increases in expression level during in vitro culture. HER3 is present 
at low to undetectable levels at DO and increases in expression level during in vitro culture. HER4 was not 
identified at any time during lung development. HRG was not identified on DO, however, it was identified as 
a 75,000 Da protein on D1-D2 and continued to rise in expression level throughout time in culture. This 

20 human explant model recapitulates part of the normal lung developmental program over the 5 days in 
culture. Development occurs rapidly with both epithelial cell proliferation, and differentiation occurring, 
along with air space formation. These data indicate that the HER/HRG system is also modulated during in 
vitro human lung development, and the active receptor may be a HER2/HER3 heterodimer. The third 
trimester represents late pseudoglandular (days 42-1 12) and canalicular stage (days 1 12-196) in human lung 

25 development. As in the rat, during the pseudoglandular stage epithelial cell proliferation and formation of 
the prospective airways occurs. During the canalicular stage differentiation of the epithelium occurs. 
Example 4 - Expression of HER2 and HER3 in human lung 

HER2 and HER3 are expressed exclusively in the pulmonary epithelium during lung devtiupment. 
Mid-trimester human lung was cultured in vitro as outlined above. Tissue was harvested daily, snap frozen, 

30 and 5 micron sections cut. The sections were mounted on glass slides, and immunohistochemistry performed 
using standard ABC procedures. HER2, HER3, and HER4 were identified using specific antibodies as 
described above. 

As a control, lung tissue was stained at DO and D5 with an irrelevant antibody and no staining was 

detected. At DO, the lung is relatively unformed. The majority of the tissue is mesenchymal cells. Early air 
35 spaces are being developed. On D5, air spaces are clearly identifiable, with thinning of the mesenchymal 

tissue and proliferation of the lung epithelium required to cover the enlarging air spaces. 

Using HER2 staining at DO and D5, it was established that HER 2 is uniformly present on DO lung 

tissue throughout the lung epithelial tissue. No expression was present in the mesenchymal tissue. By D5, 

HER2 remains uniform and restricted to the pulmonary epithelium. 
40 Using HER3 staining control at DO and D5, HER3 was identifiable in DO lung tissue. The staining 

was relatively less than HER2. and was not homogenous, suggesting that there are specific epithelial areas 

expressing the HER3 receptor. By D5, expression had become more homogenous throughout the lung 

epithelium, but clearly not uniform. Expression remained epithelium specific. 

Example 5 - Preparation of rHRGPl j 77.244 
45 The EGF-like domain fragment HRG-pl 177-244 was amplified from vector pHL89 (which is 

described in Holmes et a!.. Science 256:1205-1210 (1992)) by PCR with primers having Nsil/Xbal- 
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containing overhangs. The fragment was inserted into phagemid display vector pam-g3 by restriction digest- 
ligaiion at the same sites to generate construct pHRG2-g3 (177-244). pam-g3 was a derivative of phGHam- 
g3, which was designed for phage display of human growth hormone (hGH) and was described in Lowman 
et aL Biochemistry 30:10832-10838 (1991). pam-g3 was produced by removing the hGH gene present in 
phGHam-g3. and replacing this gene with a stuffer fragment, which provides space for cleavage at the 
restriction sites used for cloning. The HRG-Pl fragment was attached to residue 247 of pi II. 

The HRG-pl EGF-like domain expressed from the above-described construct is designated by 

removing the "p" and the "-g3" that appear in the name of the construct. Thus, the HRG-pl EGF-like 
domain expressed from the pHRG2-g3 construct is designated "HRG2." 

The domain was displayed monovalently on phage as a pi 1 1 fusion protein, as described by Bass et 
aL Proteins 8:309-314 (1990). 

Similarly, variants HRG-Pl 147.22?' HRG-pi 147.244* and HRG-pl 177.227 were prepared and 
expressed as described above. 

Example 6 - rHRGPl j ca °ses accelerated lung development 

Exogenous rHRGpl 177.244 caused accelerated lung development in vitro. To determine if the 
expressed HER2/HER3 receptors were functional and the role of HRG stimulation during lung development 
in vitro, rHRGpl 177.244 w as added to the in vitro culture at 10 nM. Tissue was harvested at D5, snap 

frozen, and 5 micron sections were cut for analysis. 

The morphology, in comparison to the untreated control specimens was grossly different. There 
was marked proliferation of the epithelium. Air spaces which are typically lined with a single cell layer now 
had a 2-3 cell thickness. The changes were dose dependent with more epithelial cell response with higher 
concentrations. HER2 and HER3 were still identifiable in the epithelium only. 
Example 7 - Human lung differentiation 

Differentiation of human lung epithelial cells occurs after HRG treatment. Differentiation was 
measured by Surfactant Protein A (SPA) production. All sections were stained for SPA. Human lung 
explant stained for SPA with stain localizing in epithelial cells of the prealveolar ducts, human lung explant 
exposed to 10 nM HRG showed an effect on SPA production. As a differentiation control, a lung explant 
was exposed to ImM diburyryl cAMP. A negative control was also run. 
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WE CLAIM: 

1. A method of inducing epithelial cell growth and/or proliferation, comprising contacting a normal 
epithelial cell which expresses HER2, HER3 and/or HER4 receptors with an effective amount of an isolated 
ligand which activates HER2, HER3, HER4 receptors or a combination thereof. 

2. The method of Claim 1, wherein the activating ligand is a hereguiin (HRG) polypeptide, HRG 
variant, HRG agonist antibody or fragment thereof capable of binding to the HER2. HER3 and/or HER4 
receptor. 

3. The method of Claim 2. wherein the activating ligand is human HRG or a fragment thereof. 

4. The method of Claim 2, wherein the activating ligand is selected from the group consisting of HRG- 
a, -pi , -P2, -P2-like, and -p3 and fragments thereof. 

5. The method of Claim 2. wherein the activating ligand is y-HRG or a fragment thereof. 

6. The method of Claim 2, wherein the activating ligand is recombinant human HRG or a fragment 
thereof. 

7. The method of Claim 2, wherein the activating ligand is sen:, ry and motor neuron derived factor 
(SMDF) or a fragment thereof. 

8. The method of Claim I, wherein the activating ligand is administered at a daily dose of about I 
ug/kg to 100 mg/kg. 

9. The method of Claim 2, wherein the activating ligand is an agonist antibody. 

1 0. The method of Claim I . wherein the contacting is by administration to a patient in need thereof. 

11. The method of Claim 6, wherein the HRG is rHRG-pi -1 77-244. 

1 2. The method of Claim I, wherein the epithelial cell is a lung cell. 

13. The method of Claim I wherein the epithelial cell expresses HER2/HER3, HER2/HER4, 
HER3/HER4. HER3 or HER4. 

14. A method of increasing lung surfactant protein A. comprising administering to a patient in need 
thereof an effective amount of an isolated HER2, HER3 and/or HER4 activating ligand. 

15. The method of Claim 14, wherein the activating ligand is a hereguiin (HRG) polypeptide, HRG 
variant, HRG agonist antibody or fragment thereof capable of binding to the HER2. HER3 and/or HER4 
receptor. 

16. A method of treating chronic obstructive pulmonary disease, comprising administering to a patient 
in need thereof an effective amount of an isolated HER2, HER3 and/or HER4 activating ligand. 
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17. The method of Claim 16. wherein the activating ligand is a heregulin (HRG) polypeptide, HRG 
variant. HRG agonist antibody or fragment thereof capable of binding to the HER2, HER3 and/or HER4 
receptor. 

18. A method of treating respiratory distress or emphysema, comprising administering to a patient in 
need thereof an effective amount of an isolated HER2, HER3 and/or HER4 activating ligand. 

19. The method of Claim 18, wherein the activating ligand is a heregulin (HRG) polypeptide, HRG 
variant, HRG agonist antibody or fragment thereof capable of binding to the HER2, HER3 and/or HER4 
receptor. 

20. A method, comprising the steps of: 

(a) obtaining a normal epithelial cell sample from a mammal; 

(b) contacting the sample with a ligand which activates HER2, HER3, HER4 or a combination 
thereof to induce growth and/or proliferation of epithelial cells in the sample and to obtain an expanded 
sample; and 

(c) re-introducing the expanded sample into the mammal. 
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GG GCG CGA GCG CCT CAG CGC GGC CGC TCG CTC TCC CCC 38 
Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro 
1 5 10 

TCG AGG GAC AAA CTT TTC CCA AAC CCG ATC CGA GCC CTT 77 
Ser Arg Asp Lys Leu Phe Pro Asn Pro He Arg Ala Leu 
15 20 25 

GGA CCA AAC TCG CCT GCG CCG AGA GCC GTC CGC GTA GAG 116 
Gly Pro Asn Ser Pro Ala Pro Arg Ala Val Arg val Glu 

30 35 

CGC TCC GTC TCC GGC GAG ATG TCC GAG CGC AAA GAA GGC 155 
Aro Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly 
40 45 50 

AGA GG" AAA GGG AAG GGC AAG AAG AAG GAG CGA GGC TCC 194 
Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser 
55 60 

GGC AAG AAG CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA 233 
Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro 
65 70 75 

GCC TTG CCT CCC CGA TTG AAA GAG ATG AAA AGC CAG GAA 272 
Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
80 *5 SO 

TCG GCT GCA GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC 311 
Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr 

95 100 

AGT TCT GAA TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG 350 
Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys 
105 HO ll 5 

AAT GGG AAT GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT 389 
Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn 
120 125 

ATC AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC 428 
He Lvs He Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
130 135 140 

ATT AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG 467 
He Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
145 ISO 155 

TGC AAA GTG ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT 506 
Cys Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser 

160 165 
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GCC AAT ATC ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT 545 
Ala Asn lie Thr He Val Glu Ser Asn Glu He He Thr 
170 175 180 

GGT ATG CCA GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA 584 
Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
185 190 

GAG TCT CCC ^ ATT AGA ATA TCA GTA TCC ACA GAA GGA GCA 623 
Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala 
195 200 205 

AAT ACT TCT TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA 662 
Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 
210 215 220 

AGC CAT CTT GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC 701 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe 

225 230 

TGT GTG AAT GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT 740 
Cys val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu 
235 240 245 

TCA AAC CCC TCG AGA TAC TTG TGC AAG TGC CAA CCT GGA 779 
Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Gin Pro Gly 
250 255 

TTC ACT GGA GCA AGA TGT ACT GAG AAT GTG CCC ATG AAA 818 
Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
260 265 270 

GTC CAA AAC CAA GAA AAG GCG GAG GAG CTG TAC CAG AAG 857 
Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 
275 280 285 

AGA GTG CTG ACC ATA ACC GGC ATC TGC ATC GCC CTC CTT 896 
Arg Val Leu Thr He Thr Gly He Cys He Ala Leu Leu 

290 295 

GTG GTC GGC ATC ATG TGT GTG GTG GCC TAC TGC AAA ACC 935 
Val Val Gly He Met Cys Val Val Ala Tyr cys Lys Thr 
300 305 310 

AAG AAA CAG CGG AAA AAG CTG CAT GAC CGT CTT CGG CAG 974 
Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 
315 320 

AGC CTT CGG TCT GAA CGA AAC AAT ATG ATG AAC ATT GCC 1013 
Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn He Ala 
325 330 335 
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AAT GGG CCT CAC CAT CCT AAC CCA CCC CCC GAG AAT GTC 1052 
Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val 
340 345 350 

CAG CTG GTG AAT CAA TAG GTA TCT AAA AAC GTC ATC TCC 1091 
Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val lie Ser 

355 360 

AGT GAG CAT ATT GTT GAG AGA GAA GCA GAG ACA TCC TTT 1130 
Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe 
365 370 375 

TCC ACC AGT CAC TAT ACT TCC ACA GCC CAT CAC TCC ACT 1169 
Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr 
380 385 

ACT GTC ACC CAG ACT CCT AGC CAC AGC TGG AGC AAC GGA 1208 
Thr Val Thr Gin Thr Pro Ser His Ser Tip Ser Asn Gly 
390 395 400 

CAC ACT GAA AGC ATC CTT TCC GAA AGC CAC TCT GTA ATC 1247 
His Thr Glu Ser He Leu Ser Glu Ser His Ser Val He 
405 410 415 

GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC CCA 1286 
Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro 

420 425 

ACT GGG GGC CCA AGA GGA CGT CTT AAT GGC ACA GGA GGC 1325 
Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly 
430 435 440 

CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA GAA 1364 
Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu 
445 450 

ACC CCT GAT TCC TAG CGA GAC TCT CCT CAT AGT GAA AGG 1403 
Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
455 460 465 

TAT GTG TCA GCC ATG ACC ACC CCG GCT CGT ATG TCA CCT 1442 
Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro 
470 475 480 

GTA GAT TTC CAC ACG CCA AGC TCC CCC AAA TCG CCC CCT 1481 
Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro 

485 490 

TCG GAA ATG TCT CCA CCC GTG TCC AGC ATG ACG GTG TCC 1520 
Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser 
495 500 505 
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ATG CCT TCC ATG GCG GTC AGC CCC TTC ATG GAA GAA GAG 1559 
Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 
510 515 

AGA CCT CTA CTT CTC GTG ACA CCA CCA AGG CTG CGG GAG 1598 
Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu 
520 525 530 

AAG AAG TTT GAC CAT CAC CCT CAG CAG TTC AGC TCC TTC 1637 
Lvs Lvs Phe Asp His His Pro Gin Gin Phe Ser Ser Phe 
535 540 545 

CAC CAC AAC CCC GCG CAT GAC AGT AAC AGC CTC CCT GCT 1676 
His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala 

550 555 

AGC CCC TTG AGG ATA GTG GAG GAT GAG GAG TAT GAA ACG 1715 
Ser Pro Leu Arg He Val Glu Asp Glu Glu Tyr Glu Thr 
560 565 570 

ACC CAA GAG TAC GAG CCA GCC CAA GAG CCT GTT AAG AAA 1754 
Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys 
575 580 

CTC GCC AAT AGC CGG CGG GCC AAA AGA ACC AAG CCC AAT 1793 
Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn 
585 590 595 

GGC CAC ATT GCT AAC AGA TTG GAA GTG GAC AGC AAC ACA 1832 
Gly His He Ala Asn Arg Leu Glu Val Asp Ser Asn Thr 
600 605 610 

AGC TCC CAG AGC AGT AAC TCA GAG AGT GAA ACA GAA GAT 1871 
Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp 

615 620 

GAA AGA GTA GGT GAA GAT ACG CCT TTC CTG GGC ATA CAG 1910 
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly He Gin 
625 630 635 

AAC CCC CTG GCA GCC AGT CTT GAG GCA ACA CCT GCC TTC 1949 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe 
640 645 

CGC CTG GCT GAC AGC AGG ACT AAC CCA GCA GGC CGC TTC 1988 
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
650 655 660 

TCG ACA CAG GAA GAA ATC CAG G 2010 
Ser Thr Gin Glu Glu He Gin 
665 669 
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GG GAC AAA CTT TTC CCA AAC CCG ATC CGA GCC CTT GGA 38 
Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly 
15 10 

CCA AAC TCG CCT GCG CCG AGA GCC GTC CGC GTA GAG CGC 77 
pro Asn Sex Pro Ala Pro Arg Ala Val Arg Val Glu Arg 
15 20 25 

TCC GTC TCC GGC GAG ATG TCC GAG CGC AAA GAA GGC AGA 116 
Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly Arg 

30 35 

GGC AAA GGG AAG GGC AAG AAG AAG GAG CGA GGC TCC GGC 155 
Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly 
40 45 50 

AAG AAG CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA GCC 194 
Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala 
55 60 

TTG CCT CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG 233 
Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser 
65 70 75 

GCT GCA GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC AGT 272 
Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
80 85 90 

TCT GAA TAG TCC TCT CTC AGA TTC AAG TGG TTC AAG AAT 311 
Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 

95 100 

GGG AAT GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT ATC 350 
Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie 
105 HO US 

AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC ATT 389 
Lys lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie 
120 125 

AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG TGC 428 
Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
130 135 140 

AAA GTG ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT GCC 467 
Lys val He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 
145 150 155 

AAT ATC ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT GGT 506 
Asn He Thr He Val Glu Ser Asn Glu He He Thr Gly 

160 165 
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ATG CCA GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA GAG 545 
Met Pro Ala ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
170 175 180 

TCT CCC ATT AGA ATA TCA GTA TCC ACA GAA GGA GCA AAT 584 
Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn 
185 190 

ACT TCT TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA AGC 623 
Thr Ser Ser Ser Thr ser Thr ser Thr Thr Gly Thr Ser 
195 200 205 

CAT CTT GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT 662 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
210 215 220 

GTG AAT GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT TCA 701 
val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 

225 230 

AAC CCC TCG AGA TAC TTG TGC AAG TGC CCA AAT GAG TTT 740 
Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
235 240 245 

ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC 779 
Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
250 255 

TAC AAG CAT CTT GGG ATT GAA TTT ATG GAG GCG GAG GAG 818 
Tyr Lys His Leu Gly He Glu Phe Met Glu Ala Glu Glu 
260 265 270 

CTG TAC CAG AAG AGA GTG CTG ACC ATA ACC GGC ATC TGC 857 
Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys 
275 280 285 

ATC GCC CTC CTT GTG GTC GGC ATC ATG TGT GTG GTG GCC 896 
He Ala Leu Leu Val Val Gly He Met Cys Val Val Ala 

290 295 

TAC TGC AAA ACC AAG AAA CAG CGG AAA AAG CTG CAT GAC 935 
Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp 
300 305 310 

CGT CTT CGG CAG AGC CTT CGG TCT GAA CGA AAC AAT ATG 974 
Ara Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met 
315 320 

ATG AAC ATT GCC AAT GGG CCT CAC CAT CCT AAC CCA CCC 1013 
Met Asn He Ala Asn Gly Pro His His Pro Asn Pro Pro 
325 330 335 
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CCC GAG AAT GTC CAG CTG GTG AAT CAA TAC GTA TCT AAA 1052 
Pro Glu Asn Val Gin Leu Val Asn Gin Tyr val Ser Lys 
340 345 350 

AAC GTC ATC TCC ACT GAG CAT ATT GTT GAG AGA GAA GCA 1091 
Asn Val He Ser Ser Glu His He Val Glu Arg Glu Ala 

355 360 

GAG ACA TCC TTT TCC ACC AGT CAC TAT ACT TCC ACA GCC 1130 
Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala 
365 370 375 

CAT CAC TCC ACT ACT GTC ACC CAG ACT CCT AGC CAC AGC 1169 
His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser 
380 385 

TGG AGC AAC GGA CAC ACT GAA AGC ATC CTT TCC GAA AGC 1208 
Trp Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser 
390 395 400 

CAC TCT GTA ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG 1247 
His Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg 
405 410 415 

CAC AGC AGC CCA ACT GGG GGC CCA AGA GGA CGT CTT AAT 1286 
His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 

420 425 

GGC ACA GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG 1325 
Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
430 435 440 

CAT GCC AGA GAA ACC CCT GAT TCC TAC CGA GAC TCT CCT 1364 
His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro 
445 450 

CAT AGT GAA AGG TAT GTG TCA GCC ATG ACC ACC CCG GCT 1403 
His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala 
455 460 465 

CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA AGC TCC CCC 1442 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 
470 475 480 

AAA TCG CCC CCT TCG GAA ATG TCT CCA CCC GTG TCC AGC 1481 
Lys Ser Pro Pro ser Glu Met ser Pro Pro val Ser Ser 

485 490 

ATG ACG GTG TCC ATG CCT TCC ATG GCG GTC AGC CCC TTC 1520 
Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe 
495 500 505 
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ATO GAA GAA GAG AGA CCT CTA CTT CTC GTG ACA CCA CCA 1559 
Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro 
510 515 

AGG CTO CGG GAG AAG AAG TTT GAC CAT CAC CCT CAG CAG 1598 
Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin 
520 525 530 

TTC AGC TCC TTC CAC CAC AAC CCC GCG CAT GAC AGT AAC 1637 
Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn 
535 540 545 

AGC CTC CCT GCT AGC CCC TTG AGG ATA GTG GAG GAT GAG 1676 
Ser Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu 

550 555 

GAG TAT GAA ACG ACC CAA GAG TAC GAG CCA GCC CAA GAG 1715 
Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu 
560 565 570 

CCT GTT AAG AAA CTC GCC AAT AGC CGG CGG GCC AAA AGA 1754 
Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg 
575 580 

ACC AAG CCC AAT GGC CAC ATT GCT AAC AGA TTG GAA GTG 1793 
Thr Lys Pro Asn Gly His lie Ala Asn Arg Leu Glu Val 
585 590 595 

GAC AGC AAC ACA AGC TCC CAG AGC AGT AAC TCA GAG AGT 1832 
Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser 
600 605 610 
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GAA ACA GAA GAT GAA AGA GTA GGT GAA GAT ACG CCT TTC 1871 

Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 

615 620 

CTG GGC ATA CAG AAC CCC CTG GCA GCC AGT CTT GAG GCA 1910 
Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 
625 630 635 

ACA CCT GCC TTC CGC CTG GCT GAC AGC AGG ACT AAC CCA 1949 
Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro 
640 645 

GCA GGC CGC TTC TCG ACA CAG GAA GAA ATC CAG GCC AGG 1988 
Ala Gly Arg Phe Ser Thr Gin Glu Glu lie Gin Ala Arg 
650 655 660 

CTG TCT AGT GTA ATT GCT AAC CAA GAC CCT ATT GCT GTA TA 2029 
Leu Ser Ser Val lie Ala Asn Gin Asp Pro lie Ala Val 
665 670 675 

A AACCTAAATA AACACATAGA TTCACCTGTA AAACTTTATT 2070 

TTATATAATA AAGTATTCCA CCTTAAATTA AACAATTTAT TTTATTTTAG 2120 

CAGTTCTGCA AATAGAAAAC AGGAAAAAAA CTTTTATAAA TTAAATATAT 2170 

GTATGTAAAA ATGAAAAAAft AAAAAAAAA' 2199 
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GTGGCTGCGG GGCAATTGAA AAAGAGCCGG CGAGGAGTTC CCCGAAACTT SO 

GTTGGAACTC CGGGCTCGCG CGGAGGCCAG GAGCTGAGCG GCGGCGGCTG 100 

CCGGACGATG GGAGCGTGAG CAGGACGGTG ATAACCTCTC CCCGATCGGG 150 

TTGCGAGGGC GCCGGGCAGA GGCCAGGACG CGAGCCGCCA GCGGCGGGAC 200 

CCATCGACGA CTTCCCGGGG CGACAGGAGC AGCCCCGAGA GCCAGGGCGA 250 

GCGCCCGTTC CAGGTGGCCG GACCGCCCGC CGCGTCCGCG CCGCGCTCCC 300 

TGCAGGCAAC GGGAGACGCC CCCGCGCAGC GCGAGCGCCT CAGCGCGGCC 350 

GCTCGCTCTC CCCATCGAGG GACAAACTTT TCCCAAACCC GATCCGAGCC 400 

CTTGGACCAA ACTCGCCTGC GCCGAGAGCC GTCCGCGTAG AGCGCTCCGT 450 

CTCCGGCGAG ATG TCC GAG CGC AAA GAA GGC AGA GGC AAA 490 
Met Ser Glu Arg Lys Glu Gly Arg Gly Lys 
1 5 10 

GGG AAG GGC AAG AAG AAG GAG' CGA GGC TCC GGC AAG AAG 529 
Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly Lys Lys 

15 20 

CCG GAG TCC GCG GCG GGC AGC CAG AGC CCA GCC TTG CCT S68 
Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala Leu Pro 
25 30 35 

CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG GCT GCA 607 
Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala 
40 <5 

GGT TCC AAA CTA GTC CTT CGG TGT GAA ACC ACT TCT GAA 646 
Glv Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
50 55 60 

TAC TCC TCT CTC AGA TTC AAG TGG TTC AAG AAT GGG AAT 6B5 
Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
65 70 75 

GAA TTG AAT CGA AAA AAC AAA CCA CAA AAT ATC AAG ATA 724 
Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lya He 

80 85 
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CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC ATT AAC AAA 763 
Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys 
90 95 100 

GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG TGC AAA GTG 802 
Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
105 110 

ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT GCC AAT ATC 841 
lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie 
115 120 125 

ACC ATC GTG GAA TCA AAC GAG ATC ATC ACT GGT ATG CCA 880 
Thr lie Val Glu Ser Asn Glu lie lie Thr Gly Met Pro 
130 135 140 

GCC TCA ACT GAA GGA GCA TAT GTG TCT TCA GAG TCT CCC 919 
Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser Pro 

145 ISO 

ATT AGA ATA TCA GTA TCC ACA GAA GGA GCA AAT ACT TCT 958 
lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
155 160 165 

TCA TCT ACA TCT ACA TCC ACC ACT GGG ACA AGC CAT CTT 997 
Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu 
170 175 

GTA AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT GTG AAT 1036 
Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
180 185 190 

GGA GGG GAG TGC TTC ATG GTG AAA GAC CTT TCA AAC CCC 1075 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro 
195 200 205 

TCG AGA TAC TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT 1114 
Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly 

210 215 

GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AAG 1153 
Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys 
220 225 230 

GCG GAG GAG CTG TAC CAG AAG AGA GTG CTG ACC ATA ACC 1192 
Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr lie Thr 
235 240 

GGC ATC TGC ATC GCC CTC CTT GTG GTC GGC ATC ATG TGT 1231 
Gly lie Cys He Ala Leu Leu Val Val Gly He Met Cys 
245 250 255 

GTG GTG GCC TAC TGC AAA ACC AAG AAA CAG CGG AAA AAG 1270 
Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lye 
260 265 270 
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CTG CAT GAC CGT CTT CGG CAG AGC CTT CGG TCT GAA CGA 1309 

Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 

275 280 

AAC AAT ATG ATG AAC ATT GCC AAT GGG CCT CAC CAT CCT 1348 
Asn Asn Met Met Asn lie Ala Asn Gly Pro His His Pro 
285 290 295 

AAC CCA CCC CCC GAG AAT GTC CAG CTG GTG AAT CAA TAC 1387 
Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr 
300 305 

GTA TCT AAA AAC GTC ATC TCC AGT GAG CAT ATT GTT GAG 1426 
Val Ser Lys Asn Val lie Ser Ser Glu His lie Val Glu 
310 315 320 

AGA GAA GCA GAG ACA TCC TTT TCC ACC AGT CAC TAT ACT 1465 
Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr 
325 320 335 

TCC ACA GCC CAT CAC TCC ACT ACT GTC ACC CAG ACT CCT 1504 
Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 

340 345 

AGC CAC AGC TGG AGC AAC GGA CAC ACT GAA AGC ATC CTT 1543 
Ser His Ser Trp Ser Asn Gly His Thr Glu Ser lie Leu 
350 355 360 

TCC GAA AGC CAC TCT GTA ATC GTG ATG TCA TCC GTA GAA 15 82 
Ser Glu Ser His Ser Val lie Val Met Ser Ser Val Glu 
365 370 

AAC AGT AGG CAC AGC AGC CCA ACT GGG GGC CCA AGA GGA 1621 
Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly 
375 380 385 

CGT CTT AAT GGC ACA GGA GGC CCT CGT GAA TGT AAC AGC 1660 
Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser 
390 395 400 

TTC CTC AGG CAT GCC AGA GAA ACC CCT GAT TCC TAC CGA 1699 
Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr Arg 

405 410 

GAC TCT CCT CAT AGT GAA AGG TAT GTG TCA GCC ATG ACC 1738 
Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr 
415 420 425 

ACC CCG GCT CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA 1777 
Thr Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro 
430 435 

AGC TCC CCC AAA TCG CCC CCT TCG GAA ATG TCT CCA CCC 1816 
Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro Pro 
440 445 450 
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GTG TCC AGC ATG ACG GTG TCC AAG CCT TCC ATG GCG GTC 1855 
Val Ser Ser Met Thr Val Ser Lys Pro Ser Met Ala Val 
455 460 465 

AGC CCC TTC ATG GAA GAA GAG AGA CCT CTA CTT CTC GTG 1894 
Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Ijou Val 

470 475 

ACA CCA CCA AGG CTG CGG GAG AAG AAG TTT GAC CAT CAC 1933 
Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His Hia 
480 485 490 

CCT CAG CAG TTC AGC TCC TTC CAC CAC AAC CCC GCG CAT 1972 
Pro Gin Gin Phe Ser Ser Phe Hia His Asn Pro Ala His 
495 500 

GAC AGT AAC AGC CTC CCT GCT AGC CCC TTG AGG ATA GTG 2011 
Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 
505 510 515 

GAG GAT GAG GAG TAT GAA ACG ACC CAA GAG TAG GAG CCA 2050 
Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro 
520 525 530 

GCC CAA GAG CCT GTT AAG AAA CTC GCC AAT AGC CGG CGG 2089 
Ala Gin Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg 

535 540 

GCC AAA AGA ACC AAG CCC AAT GGC CAC ATT GCT AAC AGA 2128 
Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn Arg 
545 550 S55 

TTG GAA GTG GAC AGC AAC ACA AGC TCC CAG AGC AGT AAC 2167 
Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn 
560 565 

TCA GAG AGT GAA ACA GAA GAT GAA AGA GTA GGT GAA GAT 2206 
Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp 
570 575 580 

ACG CCT TTC CTG GGC ATA CAG AAC CCC CTG GCA GCC AGT 2245 
Thr Pro Phe Leu Gly lie Gin Asn Pro Leu Ala Ala Ser 
585 590 595 

CTT GAG GCA ACA CCT GCC TTC CGC CTG GCT GAC AGC AGG 2284 
£eu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg 

600 605 

ACT AAC CCA GCA GGC CGC TTC TCG ACA CAG GAA GAA ATC 2323 
Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie 
610 615 620 

CAG GCC AGG CTG TCT AGT GTA ATT GCT AAC CAA GAC CCT 2362 
Gin Ala Arg Leu Ser Ser Val He Ala Asn Gin Asp Pro 
625 630 
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ATT GCT GTA TAAAACCTA AATAAACACA TAGATTCACC TGTAAAACTT 2410 
lie Ala Val 
635 637 

TATTTTATAT AATAAAGTAT TCCACCTTAA ATTAAACAAT TTATTTTATT 2460 



TTAGCAGTTC TGCAAATAAA AAAAAAAAAA 2490 
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GCGCCTGCCT CCAACCTGCG GGCGGGAGGT GGGTGGCTGC GGGGCAATTG SO 

AAAAAGAGCC GGCGAGGAGT TCCCCGAAAC TTGTTGGAAC TCCGGGCTCG 100 

CGCGGAGGCC AGGAGCTGAG CGGCGGCGGC TGCCGGACGA TGGGAGCGTG ISO 

AGCAGGACGG TGATAACCTC TCCCCGATCG GGTTGCGAGG GCGCCGGGCA 200 

GAGGCCAGGA CGCGAGCCGC CAGCGGCGGG ACCCATCGAC GACTTCCCGG 250 

GGCGACAGGA GCAGCCCCGA GAGCCAGGGC GAGCGCCCGT TCCAGGTGGC 300 

CGGACCGCCC GCCGCGTCCG CGCCGCGCTC CCTGCAGGCA ACGGGAGACG 350 

CCCCCGCGCA GCGCGAGCGC CTCAGCGCGG CCGCTCGCTC TCCCCATCGA 400 

GGGACAAACT TTTCCCAAAC CCGATCCGAG CCCTTGGACC AAACTCGCCT 450 

GCGCCGAGAG CCGTCCGCGT AGAGCGCTCC GTCTCCGGCG AG ATG 495 

Met 
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AAA AAC AAA CCA CAA AAT ATC AAG ATA CAA AAA AAG CCA 7 68 
Lys Asn Lys Pro Gin Asa lie Lys lie Gin Lys Lys Pro 
80 85 90 

GGG AAG TCA GAA CTT CGC ATT AAC AAA GCA TCA CTG GCT 807 
Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

GAT TCT GGA GAG TAT ATG TGC AAA GTG ATC AGC AAA TTA 846 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 

110 us 

GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG GAA 885 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
120 125 130 

TCA AAC GAG ATC ATC ACT GGT ATG CCA GCC TCA ACT GAA 924 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

GGA GCA TAT GTG TCT TCA GAG TCT CCC ATT AGA ATA TCA 963 
Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg lie Ser 
145 150 155 

GTA TCC ACA GAA GGA GCA AAT ACT TCT TCA TCT ACA TCT 1002 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA AAA TGT GCG 1041 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 

175 180 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1080 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG 1119 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA 1158 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC 1197 
Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
225 230 235 

TTT CTG TCT CTG CCT GAA TAGGA GCATGCTCAG TTGGTGCTGC 1240 
Phe Leu Ser Leu Pro Glu 

240 241 

TTTCTTGTTG CTGCATCTCC CCTCAGATTC CACCTAGAGC TAGATGTGTC 1290 
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TTACCAGATC TAATATTGAC TGCCTCTGCC TGTCGCATGA GAACATTAAC 1340 
AAAAGCAATT GTATTACTTC CTCTGTTCGC GACTAGTTGG CTCTGAGATA 1390 
CTAATAGGTG TGTGAGGCTC CGGATGTTTC TGGAATTGAT ATTGAATGAT 1440 
GTGATACAAA TTGATAGTCA ATATCAAGCA GTGAAATATG ATAATAAAGG 1490 
CATTTCAAAG TCTCACTTTT ATTGATAAAA TAAAAATCAT TCTACTGAAC 1540 
AGTCCATCTT CTTTATACAA TGACCACATC CTGAAAAGGG TGTTGCTAAG 1590 
CTGTAACCGA TATGCACTTG AAATGATGGT AAGTTAATTT TGATTCAGAA 1640 
TGTGTTATTT GTCACAAATA AACATAATAA AAGGAGTTCA GATGTTTTTC 1690 
TTCATTAACC AAAAAAAAAA AAAAA 1715 

FIG..4C 
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GAGGCGCCTG CCTCCAACCT GCGGGCGGGA GGTGGGTGGC TGCGGGGCAA 50 
TTGAAAAAGA GCCGGCGAGG AGTTCCCCGA AACTTGTTGG AACTCCGGGC 100 
TCGCGCGGAG GCCAGGAGCT GAGCGGCGGC GGCTGCCGGA CGATGGGAGC 150 
GTGAGCAGGA CGGTGATAAC CTCTCCCCGA TCGGGTTGCG AGGGCGCCGG 200 
GCAGAGGCCA GGACGCGAGC CGCCAGCGGC GGGACCCATC GACGACTTCC 250 
CGGGGCGACA GGAGCAGCCC CGAGAGCCAG GGCGAGCGCC CGTTCCAGGT 300 
GGCCGGACCG CCCGCCGCGT CCGCGCCGCG CTCCCTGCAG GCAACGGGAG 350 
ACGCCCCCGC GCAGCGCGAG CGCCTCAGCG CGGCCGCTCG CTCTCCCCAT 400 
CGAGGGACAA ACTTTTCCCA AACCCGATCC GAGCCCTTGG ACCAAACTCG 450 



CCTGCGCCGA GAGCCGTCCG CGTAGAGCGC TCCGTCTCCG GCGAG AT 497 

Met 
1 

G TCC GAG CGC AAA GAA GGC AGA GGC AAA GGG AAG GGC AAG 537 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

AAG AAG GAG CGA GGC TCC GGC AAG AAG CCG GAG TCC GCG 576 
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

GCG GGC AGC CAG AGC CCA GCC TTG CCT CCC CAA TTG AAA 615 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

GAG AXG AAA AGC CAG GAA TCG GCT GCA GGT TCC AAA CTA 654 
Glti Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 

45 50 

GTC CTT CGG TGT GAA ACC ACT TCT GAA TAC TCC TCT CTC 693 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
55 60 65 

AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA 732 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
70 75 



FIG.. 5 A 

SUBSTITUTE SHEET (RULE 26) 



BNSDOCIO <WO 9939729A2J. > 



WO 99/39729 PCT/US99/02390 

19/29 

AAA AAC AAA CCA CAA AAT ATC AAG ATA CAA AAA AAG CCA 771 
Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

GGG AAG TCA GAA CTT CGC ATT AAC AAA GCA TCA CTG GCT 810 
Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 
95 100 105 

GAT TCT GGA GAG TAT ATG TGC AAA GTG ATC AGC AAA TTA 849 
Asp Ser Gly Glu Tyr Met. Cys Lys Val lie Ser Lys Leu 

110 115 

GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG GAA 888 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
120 125 130 

TCA AAC GAG ATC ATC ACT GGT ATG CCA GCC TCA ACT GAA 927 
Ser Asn Glu He He Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

GGA GCA TAT GTG TCT TCA GAG TCT CCC ATT AGA ATA TCA 966 
Gly Ala Tyr Val Ser Ser Glu Ser Pro He Arg He Ser 
145 150 155 

GTA TCC ACA GAA GGA GCA AAT ACT TCT TCA TCT ACA TCT 1005 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA AAA TGT GCG 1044 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 

17S 180 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 1083 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG 1122 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA 1161 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

AAC TAC GTA ATG GCC AGC TTC TAC AAG GCG GAG GAG CTG 1200 
Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu 
225 230 235 

TAC CAG AAG AGA GTG CTG ACC ATA ACC GGC ATC TGC ATC 1239 
Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys He 

240 245 

GCC CTC CTT GTG GTC GGC ATC ATG TGT GTG GTG GCC TAC 1278 
Ala Leu Leu Val Val Gly He Met Cys Val Val Ala Tyr 
250 255 260 

FIG. SB 
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TGC AAA ACC AAG AAA CAG CGG AAA AAG CTG CAT GAC CGT 1317 
Cya Lys Thr Lya Lys Gin Arg Lys Lys Lou His Aap Arg 
265 270 

CTT CGG CAG AGC CTT CGG TCT GAA CGA AAC AAT ATG ATG 1356 
Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met Met 
275 280 285 

AAC ATT GCC AAT GGG CCT CAC CAT CCT AAC CCA CCC CCC 1395 
Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

GAG AAT GTC CAG CTG GTG AAT CAA TAC GTA TCT AAA AAC 1434 
Glu Aan Val Gin Leu Val Asn Gin Tyr Val Ser Lya Asn 

305 310 

GTC ATC TCC AGT GAG CAT ATT GTT GAG AGA GAA GCA GAG .1473 
Val lie Ser Ser Glu His lie Val Glu Arg Glu Ala Glu 
315 320 325 

ACA TCC TTT TCC ACC AGT CAC TAT ACT TCC ACA GCC CAT 1512 
Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His 
330 335 

CAC TCC ACT ACT GTC ACC CAG ACT CCT AGC CAC AGC TGG 1551 
His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
340 345 350 

AGC AAC GGA CAC ACT GAA AGC ATC CTT TCC GAA AGC CAC 1590 
Ser Asn Gly Hi3 Thr Glu Ser He Leu Ser Glu Ser His 
355 360 365 

TCT GTA ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC 1629 
Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg His 

370 375 

AGC AGC CCA ACT GGG GGC CCA AGA GGA CGT CTT AAT GGC 1668 
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly " 
380 385 390 

ACA GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT 1707 
Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His 
395 400 

GCC AGA GAA ACC CCT GAT TCC TAC CGA GAC TCT CCT CAT 1746 
Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His 

405 410 415 v. a-yy.-N" 

AGT GAA AGG TAAAA CCGAAGGCAA AGCTACTGCA GAGGAGAAAC 1790 
Ser Glu Arg 
420 

FIG.-5C 
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TCAGTCAGAG AATCCCTGTG AGCACCTGCG GTCTCACCTC AGGAAATCTA 1840 
CTCTAATCAG AATAAGGGGC GGCAGTTACC TGTTCTAGGA GTGCXCCTAG 1890 
TTGATGAAGT CATCTCTTTG TTTGACGGAA CTTATTTCTT CTGAGCTTCT 1940 
CTCGTCGTCC CAGTGACTGA CAGGCAACAG ACTCTTAAAG AGCTGGGATG 1990 
CTTTGATGCG GAAGGTGCAG CACATGGAGT TTCCAGCTCT GGCCATGGGC 2040 
TCAGACCCAC TCGGGGTCTC AGTGTCCTCA GTTGTAACAT TAGAGAGATG 2090 
GCATCAATGC TTGATAAGGA CCCTTCTATA ATTCCAATTG CCAGTTATCC 2140 
AAACTCTGAT TCGGTGGTCG AGCTGGCCTC GTGTTCTTAT CTGCTAACCC 2190 
TGTCTTACCT TCCAGCCTCA GTTAAGTCAA ATCAAGGGCT ATGTCATTGC 2240 
TGAATGTCAT GGGGGGCAAC TGCTTGCCCT CCACCCTATA GTATCTATTT 2290 
TATGAAATTC CAAGAAGGGA TGAATAAATA AATCTCTTGG ATGCTGCGTC 2340 
TGGCAGTCTT CACGGGTGGT TTTCAAAGCA GAAAAAAAAA AAAAAAAAAA 2390 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 2431 

FIG..5D 
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1 GGGTACCATGGGTCGGTGAGCGCGTTTCCCGCCTGAGCGCAACTAGCGGC 

51 GGGTCGTGGGCACCTCCAGAAAAGATCCCGCACCATCCTCCAGGATCCAA 

101 TGGCCTTGGAGAGAGGGCTGCAGGGCCCACGGACATTGCTGACTCTTCAG 

151 AACGTGCTGACATGGAGCCAGGTAGACTGAAATTATCATGTGTCCAAATT 

2 01 AAAATTGCATACTTCAAGGATTATTTGAAGGACTATTCTTAGACCCTTTT 

2 51 AAGAAGATTTAAAGAAAAACCACTCGGCCCTGAGTGCGGCGAGGACCCTG 

3 01 TTTGTGGATGTGGAGGAGCGCGGGCCGGAGGCCATGGACGTGAAGGAGAG 

± M D V K E R 

351 GAAGC CTTACCGCTCGC T G AC CCGG C GC CGC G AC GCCG AG CGCCGCT AC A 

7 RPYRSIiTRRRDAERRYT 

4 01 CCAGCTCGTCCGCGGACAGCGAGGAGGGCAAAGCCCCGCAGAAATCGTAC 
24 S S SA DS EEGKAP Q KS Y 

4 51 AGCTCCAGCGAGACCCTGAAGGCCTACGACCAGGACGCCCGCCTAGCCTA 

40 S S S ETLK AYDQDARLAY 

501 TGGCAGCCGCGTCAAGGACATTGTGCCGCAGGAGGCCGAGGAATTCTGCC 

57 GSRV'KDIVPQEAEEFCR 

551 GCACAGGTGCCAACTTCACCCTGCGGGAGCTGGGGCTGGAAGAAGTAACG 

74 T GANFTLREIiGLEEVT 

601 CCCCCTCACGGGACCCTGTACCGGACAGACATTGGCCTCCCCCACTGCGG 
gOPPHGT.LYRTDIGLPHCG 

651 CTACTCCATGGGGGCTGGCTCTGATGCCGACATGGAGGCTGACACGGTGC 

107 YSMGAGSDADMEADTVL 

7 01 TGTCCCCTGAGCACCCCGTGCGTCTGTGGGGCCGGAGCACACGGTCAGGG 

124 S PEHPVRLWGRSTRS G 

7 51 C GC AG CTC CT GCC T GT C C AGCC GGG CC AATTCCAATCTC AC ACT C AC CG A 
140 RSSCLSSRANSNLTLTD 

8 01 CACCGAGCATGAAAACACTGAGACTGATCATCCGGGCGGCCTGCAGAACC 
157 TEHENTETDHPGGLQNH 

8 51 ACGCGCGGCTCCGGACGCCGCCGCCGCCGCTCTCGCACGCCCACACCCCC 
174 ARLRTPPPPLSHAHTP 

9 01 AACCAGCACCACGCGGCCTCCATTAACTCCCTGAACCGGGGCAACTTCAC 
190 NQHHAASINSLNRGNFT 

951 GCCGAGGAGCAACCCCAGCCCGGCCCCCACGGACCACTCGCTCTCCGGAG 

207 PRSNPS PAPTDHS LS GE 

10 01 AGCCCCCTGCCGGCGGCGCCCAGGAGCCTGCCCACGCCCAGGAGAACTGG 

224 PPAGGAQEPAHAQENW 

1051 CTGCTCAACAGCAACATCCCCCTGGAGACCAGAAACCTAGGCAAGCAGCC 

240 LiLNSNI PLETRNLGKQ P 
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1101 ATTCCTAGGGACATTGCAGGACAACCTCATTGAGATGGACATTCTCGGCG 
257 FLGTLQDNLIEMDI LGA 

1151 CCTCCCGCCATGATGGGGCTTACAGTGACGGGCACTTCCTCTTCAAGCCT 
274 SRHDGAYSDGHFLFKP 

12 01 GGAGGCACCTCCCCGCTCTTCTGCACCACATCACCAGGGTACCCACTGAC 
290 GGTSPLFCTTSPGYPLT 

12 51 GTGCAGCACAGTGTACTCTCCTCCGCCCCGACCCCTGCCCCGCAGCACCT 
307 S STVYSPPPRPLPRSTF 

13 01 TCGCCCGGCCGGCCTTTAACCTCAAGAAGCCCTCCAAGTACTGTAACTGG 
324 ARPAFNLKKPSKYCNW 

13 51 AAGTGCGCAGCCCTGAGCGCCATCGTCATCTCAGCCACTCTGGTCATCCT 
34 0 K C AALSAIVISA TLVIL 

14 01 GCTGGCATACTTTGTGGCCATGCACCTGTTTGGCCTAAACTGGCACCTGC 
357 L A Y F V A M HLFGLNWHLQ 

14 51 AGCCGATGGAGGGGCAGATGTATGAGATCACGGAGGACACAGCCAGCAGT 
374 PMEGQMYEITEDTAS S 

15 01 TGGCCTGTGCCAACCGACGTCTCCCTATACCCCTCAGGGGGCACTGGCTT 
390 WPVPTDVSLYPSGGTGL 

1551 AGAGACCCCTGACAGGAAAGGCAAAGGAACCACAGAAGGAAAGCCCAGTA 
407 E TPDRKGKGTTEGKP S S 

16 01 GTTTCTTTCCAGAGGACAGTTTCATAGATTCTGGAGAAATTGATGTGGGA 
424 FFPEDSFIDS. GEIDVG 

1651 AGGCGAGCTTCCCAGAAGATTCCTCCTGGCACTTTCTGGAGATCTC AAGT 
440 RRASQKIPPGTFWRSQV 

• 

17 01 GTTCATAGACCATCCTGTGCATCTGAAATTCAATGTGTCTCTGGGAAAGG 
457 FIDHPVHLKFNVSLGKA 

17 51 CAGCCCTGGTTGGCATTTATGGCAGAAAAGGCCTCCCTCCTTCACATACA 
474 A LVGIYGRKGLP PSHT 

18 01 CAGTTTGACTTTGTGGAGCTGCTGGATGGCAGGAGGCTCCTAACCCAGGA 
490 QFDFVELLDGRRLLT QE 

18 51 GGCGCGGAGCCTAGAGGGGACCCCGCGCCAGTCTCGGGGAACTGTGCCCC 
507 ARSLEGTPRQSRGTVPP 

19 01 CCTCCAGCCATGAGACAGGCTTCATCCAGTATTTGGATTCAGGAATCTGG w . a . & ^ 
524 S SHETGFIQYLDSGIW 

19 51 CACTTGGCTTTTTACAATGACGGAAAGGAGTCAGAAGTGGTTTCCTTTCT 
540 HLAFYNDGKESEVVSFL 

FIG..7B 
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2 0 01 CACCACTGCCATTGCCTTGCCTCCCCGATTGAAAGAGATGAAAAGCCAGG 
557 T TAIALPPRLKEMKSQE 

2 051 AATCGGCTGCAGGTTCCAAACTAGTCCTTCGGTGTGAAACCAGTTCTGAA 
574 S AAGSKLVIiRCETSSE 

2101 TACTCCTCTCTCAGATTCAAGTGGTTCAAGAATGGGAATGAATTGAATCG 

590 YSSLRFKWFKNGNELNR 

2151 AAAAAACAAACCACAAAATATCAAGATACAAAAAAAGCCAGGGAAGTCAG 
607 RNKPQNIKIQKKPGKSE 

2 2 01 AACTTCGCATTAACAAAGCATCACTGGCTGATTCTGGAGAGTATATGTGC 
624 LRIN KASLADSGEYMC 

2 2 51 AAAGTGATCAGCAAATTAGGAAATGACAGTGCCTCTGCCAATATCACCAT 
64 0KV I S KLG ND S A S AN I T I 

2 3 01 CGTGGAATCAAACGAGATCATCACTGGTATGCCAGCCTCAACTGAAGGAG 
657 VESNEIITGMPASTEGA 

2 3 51 CATATGTGTCTTCAGAGTCTCCCATTAGAATATCAGTATCCACAGAAGGA 
674 yVSSESPIRISVSTEG 

2 4 01 GCAAATACTTCTTCATCTACATCTACATCCACCACTGGGACAAGCCATCT 

690 ANTSSSTSTSTTGT ^xtXjBX,: 

2 4 51 T G T AAAAT GT G C G G AG AAG G AG AAAACTT TCT GT GT G AATGG AG GGG AGT 

707 :; ?y^;l®:|^ 

2 5 01 GCTTCATGGTGAAAGACCTTTCAAACCCCTCGAGATACTTGTGCAAGTGC 

724 

2 5 51 CCAAATGAGTTTACTGGTGATCGCTGCCAAAACTACGTAATGGCCAGCTT 

74 0 * : :H.::N .;f ^M?/ 1 :^ QNYVMASF 

2 6 01 CTACAGTACGTCCACTCCCTTTCTGTCTCTGCCTGAATAGGAGCATGCTC 
757 YSTSTPFLSLPE 

2 6 51 AGTTGGTGCTGCTTTCTTGTTGCTGCATCTCCCCTCAGATTCCACCTAGA 

27 01 GCTAGATGTGTCTTACCAGATCTAATATTGACTGCCTCTGCCTGTCGCAT 

27 51 GAGAACATTAACAAAAGCAATTGTATTACTTCCTCTGTTCGCGACTAGTT 

2 8 01 GGCTCTGAGATACTAATAGGTGTGTGAGGCTCCGGATGTTTCTGGAATTG 

2 8 51 ATATTGAATGATGTGATACAAATTGATAGTCAATATCAAGCAGTGAAATA 

2 9 01 TGATAATAAAGGCATTTCAAAGTCTCACTTTTATTG AT AAAAT AAAAATC 

2 9 51 ATTCTACTGAACAGTCCATCTTCTTTATACAATGACCACATCCTGAAAAG 

3 0 01 GGTGTTGCTAAGCTGTAACCGATATGCACTTGAAATGATGGTAAGTTAAT 
3 0 51 TTTGATTCAGAATGTGTTATTTGTCACAAATAAACATAATAAAAGGAAAA 
3101 AAAAAAAAAAA 
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Sequence Listing 

<110> Genentech, Inc. 

<120> Use of Heregulin as a Growth Factor 

<130> P1145PCT 

5 <141> 1999-02-03 

<150> US 09/020, 598 
<151> 1998-02-04 

<160> 14 

<210> 1 

10 <211> 669 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro Ser Arg Asp 
l r 1 5 10 15 

Lys Leu Phe Pro Asn Pro He Arg Ala Leu Gly Pro Asn Ser Pro 
20 25 30 

Ala Pro Arg Ala Val Arg Val Glu Arg Ser Val Ser Gly Glu Met 
35 40 45 

2 0 Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys 

50 55 60 

Glu Arg. Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin 
65 70 75 

Ser Pro Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
25 80 85 90 

Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
95 100 105 

Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu 
110 115 120 

3 0 Leu Asn Arg Lys Asn Lys Pro Gin Asn He Lys He Gin Lys Lys 

125 130 135 

Pro Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala Asp 
140 145 150 

Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu Gly Asn Asp 
35 155 160 165 

Ser Ala Ser Ala Asn He Thr He Val Glu Ser Asn Glu He He 
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170 175 180 

Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 
185 190 195 

Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
5 200 205 210 

Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys 
215 220 225 

Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
230 235 240 

10 Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys 

245 250 255 

Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro 
260 265 270 

Met Lys Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 
15 275 280 285 

Arg Val Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu Val Val 
290 295 300 

Gly lie Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg 
305 310 315 

2 0 Lys Lys Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg 

320 325 330 

Asn Asn Met Met Asn He Ala Asn Gly Pro His His Pro Asn Pro 
335 340 345 

Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn 
25 350 355 360 

Val lie Ser Ser Glu His lie Val Glu Arg Glu Ala Glu Thr Ser 
365 370 375 

Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr 
380 385 390 

30 Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu 

395 400 405 

Ser lie Leu Ser Glu Ser His Ser Val lie Val Met Ser Ser Val 
410 415 420 

Glu Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg 
35 425 430 435 

Leu Asn Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg 
440 445 450 
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His Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser 
455 460 465 

Glu Arg Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro 
470 475 480 

5 Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu 

485 490 495 

Met Ser Pro Pro Val Ser Ser Met Thr Val Ser Met Pro Ser Met 
500 505 510 

Ala Val Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val 
10 515 520 525 

Thr Pro Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin 
530 535 540 

Gin Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser 
545 550 555 

15 Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu 

560 565 570 

Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys Leu 
575 580 585 

Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His lie 
20 590 595 600 

Ala Asn Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser 
605 610 615 

Asn Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr 
620 625 630 

25 Pro Phe Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 

635 640 645 

Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly 
650 655 660 

Arg Phe Ser Thr Gin Glu Glu lie Gin 
30 665 669 

<210> 2 * 
<211> 2226 
<212> DNA 

<213> Homo sapiens 
35 <400> 2 

99 9 c g cga gcg cct cag cgc ggc cgc teg etc tec ccc 38 
Ala Arg Ala Pro Gin Arg Gly Arg Ser Leu Ser Pro 
1 5 10 

teg agg gac aaa ctt ttc cca aac ccg ate cga gec ctt 77 
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Ser Arg Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu 
15 20 25 

gga cca aac teg cct gcg ccg aga gec gtc cgc gta gag 116 
Gly Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu 
5 30 35 

cgc tec gtc tec ggc gag atg tec gag cgc aaa gaa ggc 155 
Arg Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly 
40 45 50 

aga ggc aaa ggg aag ggc aag aag aag gag cga ggc tec 194 
10 Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser 

55 60 

ggc aag aag ccg gag tec gcg gcg ggc age cag age cca 233 
Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro 
65 70 75 

15 gee ttg cct ccc cga ttg aaa gag atg aaa age cag gaa 272 
Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
80 85 90 

teg get gca ggt tec aaa eta gtc ctt egg tgt gaa acc 311 
Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr 
20 95 100 

age tct gaa tac tec tct etc aga ttc aag tgg ttc aag 350 
Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys 
105 110 115 

aat ggg aat gaa ttg aat cga aaa aac aaa cca caa aat 389 
2 5 Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn 

120 125 

ate aag ata caa aaa aag cca ggg aag tea gaa ctt cgc 428 
lie Lys lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
130 135 140 

30 att aac aaa gca tea ctg get gat tct gga gag tat atg 467 
He Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
145 150 155 

tgc aaa gtg ate age aaa tta gga aat gac agt gee tct 506 
Cys Lys Val He Ser Lys Leu Gly Asn Asp Ser Ala Ser 
35 160 165 

gee aat ate acc ate gtg gaa tea aac gag ate ate act 545 
Ala Asn He Thr He Val Glu Ser Asn Glu He lie Thr 
170 175 180 

ggt atg cca gee tea act gaa gga gca tat gtg tct tea 584 
40 Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

185 190 

gag tct ccc att aga ata tea gta tec aca gaa gga gca 623 
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Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala 
195 200 205 

aat act tct tea tct aca tct aca tec acc act ggg aca 662 

Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr 

5 210 215 220 

age cat ctt gta aaa tgt gcg gag aag gag aaa act ttc 701 

Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe 
225 230 

tgt gtg aat gga ggg gag tgc ttc atg gtg aaa gac ctt 74 0 

10 Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu 
235 240 245 

tea aac ccc teg aga tac ttg tgc aag tgc caa cct gga 779 

Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Gin Pro Gly 
250 255 

15 ttc act gga gca aga tgt act gag aat gtg ccc atg aaa 818 

Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
260 265 270 

gtc caa aac caa gaa aag gcg gag gag ctg tac cag aag 857 

Val Gin Asn Gin Glu Lys Ala Glu Glu Leu Tyr Gin Lys 

20 275 280 285 

aga gtg, ctg acc ata acc ggc ate tgc ate gec etc ctt 896 

Arg Val Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu 
290 295 

gtg gtc ggc ate atg tgt gtg gtg gec tac tgc aaa acc 935 

2 5 Val Val Gly He Met Cys Val Val Ala Tyr Cys Lys Thr 
300 305 310 

aag aaa cag egg aaa aag ctg cat gac cgt ctt egg cag 974 

Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 
315 320 

30 age ctt egg tct gaa cga aac aat atg atg aac att gec 1013 

Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn He Ala 
325 330 335 

aat ggg cct cac cat cct aac cca ccc ccc gag aat gtc 1052 

Asn Gly Pro His His Pro Asn Pro Pro Pro Glu Asn Val 

35 340 345 350 

cag ctg gtg aat caa tac gta tct aaa aac gtc ate tec 1091 

Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He Ser 
355 360 

agt gag cat att gtt gag aga gaa gca gag aca tec ttt 1130 

40 Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe 
365 370 375 

tec acc agt cac tat act tec aca gee cat cac tec act 1169 
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Ser Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr 
380 385 

act gtc acc cag act cct age cac age tgg age aac gga 1208 
Thr Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly 
5 390 395 400 

cac act gaa age ate ctt tec gaa age cac tct gta ate 1247 
His Thr Glu Ser lie Leu Ser Glu Ser His Ser Val lie 
405 410 415 

gtg atg tea tec gta gaa aac agt agg cac age age cca 1286 
10 Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro 

420 425 

act ggg ggc cca aga gga cgt ctt aat ggc aca gga ggc 1325 
Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly 
430 435 440 

15 cct cgt gaa tgt aac age ttc etc agg cat gec aga gaa 1364 
Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg Glu 
445 450 

acc cct gat tec tac cga gac tct cct cat agt gaa agg 1403 
Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
20 455 460 465 

tat gtg tea gec atg acc acc ccg get cgt atg tea cct 1442 
Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro 
470 475 480 

gta gat ttc cac acg cca age tec ccc aaa teg ccc cct 1481 
2 5 Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro 

485 490 

teg gaa atg tct cca ccc gtg tec age atg acg gtg tec 1520 
Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr Val Ser 
495 500 505 

30 atg cct tec atg gcg gtc age ccc ttc atg gaa gaa gag 1559 
Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 
510 515 

aga cct eta ctt etc gtg aca cca cca agg ctg egg gag 1598 
Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu 
35 520 525 530 

aag aag ttt gac cat cac cct cag cag ttc age tec ttc 1637 
Lys Lys Phe Asp His His Pro Gin Gin Phe Ser Ser Phe 
535 540 545 

cac cac aac ccc gcg cat gac agt aac age etc cct get 1676 
4 0 His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro Ala 

550 555 

age ccc ttg agg ata gtg gag gat gag gag tat gaa acg 1715 
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Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr 
560 565 570 

acc caa gag tac gag cca gcc caa gag cct gtt aag aaa 1754 
Thr Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys 
5 575 580 

etc gcc aat age egg egg gcc aaa aga acc aag ccc aat 17 93 
Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr Lys Pro. Asn 
585 590 595 

ggc cac att get aac aga ttg gaa gtg gac age aac aca 1832 
10 Gly His lie Ala Asn Arg Leu Glu Val Asp Ser Asn Thr 
600 605 610 

age tec cag age agt aac tea gag agt gaa aca gaa gat 1871 
Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp 
615 620 

15 gaa aga gta ggt gaa gat acg cct ttc ctg ggc ata cag 1910 
Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Gly lie Gin 
625 630 635 

aac ccc ctg gca gcc agt ctt gag gca aca cct gcc ttc 1949 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe 
20 640 645 

cgc ctg get gac age agg act aac cca gca ggc cgc ttc 1988 
Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
650 655 660 

teg aca cag gaa gaa ate cag g ccaggctgtc tagtgtaatt 203 0 
25 Ser Thr Gin Glu Glu lie Gin 
665 669 

gctaaccaag accctattgc tgtataaaac ctaaataaac acatagattc 2080 

acctgtaaaa ctttatttta tataataaag tattccacct taaattaaac 2130 

aatttatttt attttagcag ttctgcaaat agaaaacagg aaaaaaactt 2180 

30 ttataaatta aatatatgta tgtaaaaatg aaaaaaaaaa aaaaaa 2226 

<210> 3 
<211> 675 
<212> PRT 

<213> Homo sapiens 
35 <400> 3 

Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly Pro Asn Ser 
1 5 10 15 

Pro Ala Pro Arg Ala Val Arg Val Glu Arg Ser Val Ser Gly Glu 
20 25 30 

40 Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
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35 40 45 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 
50 55 60 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
5 65 70 75 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
80 85 90 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
95 100 105 

10 Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 

110 115 120 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
125 130 135 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 
15 140 145 150 

Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 
155 160 165 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
170 175 180 

20 Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr 

185 190 195 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
200 205 210 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
25 215 220 225 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
230 235 240 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
245 250 255 

30 Met Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Ala 

260 265 270 

Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr lie Thr Gly He Cys 
275 280 285 

He Ala Leu Leu Val Val Gly He Met Cys Val Val Ala Tyr Cys 
35 .290 295 300 

Lys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin 
305 310 315 



-8- 



WO 99/39729 PCT/US99/02390 

Ser Leu Arg Ser Glu Arg Asn Asn Met Met Asn He Ala Asn Gly 
320 325 330 

Pro His His Pro Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn 
335 340 345 

5 Gin Tyr Val Ser Lys Asn Val He Ser Ser Glu His He .Val Glu 

350 355 360 

Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr 
365 370 375 

Ala His His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 
10 380 385 390 

Ser Asn Gly His Thr Glu Ser He Leu Ser Glu Ser His, Ser Val 
395 400 405 

He Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser Pro Thr 
410 415 420 

15 Gly Gly Pro Arg Gly Arg Leu Asn Gly Thr Gly Gly Pro Arg Glu 

425 430 435 

Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr 
440 445 450 

Arg Asp Ser Pro His Ser Glu Arg Tyr Val Ser Ala Met Thr Thr 
20 455 460 465 

Pro Ala Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro 
470 475 480 

Lys Ser Pro Fro Ser Glu Met Ser Pro Pro Val Ser Ser Met Thr 
485 490 495 

25 Val Ser Met Pro Ser Met Ala Val Ser Pro Phe Met Glu Glu Glu 

500 505 510 

Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu Arg Glu Lys Lys 
515 520 525 

Phe Asp His His Pro Gin Gin Phe Ser Ser Phe His His Asn Pro 
30 530 535 540 

Ala His Asp Ser Asn Ser Leu Pro Ala Ser Pro Leu Arg lie Val 
545 550 555 

Glu Asp Glu Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin 
560 565 570 

3 5 Glu Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg Thr 

575 580 585 

Lys Pro Asn Gly His He Ala Asn Arg Leu Glu Val Asp Ser Asn 
590 595 600 
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Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser Glu Thr Glu Asp Glu 
605 610 615 

Arg Val Gly Glu Asp Thr Pro Phe Leu Gly lie Gin Asn Pro Leu 
620 625 630 

5 Ala Ala Ser Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser 

635 640 645 

Arg Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie Gin 
650 655 660 

Ala Arg Leu Ser Ser Val lie Ala Asn Gin Asp Pro lie Ala Val 
10 665 670 675 

<210> 4 
<211> 2199 
<212> DNA 

<213> Homo sapiens 
15 <400> 4 

99 9&c aaa ctt ttc cca aac ccg acc cga gcc ctt gga 38 
Asp Lys Leu Phe Pro Asn Pro lie Arg Ala Leu Gly 
15 10 

cca aac teg cct gcg ccg aga gcc gtc cgc gta gag cgc 77 
2 0 Pro Asn Ser Pro Ala Pro Arg Ala Val Arg Val Glu Arg 
15 20 25 

tec gtc tec ggc gag atg tec gag cgc aaa gaa ggc aga 116 
Ser Val Ser Gly Glu Met Ser Glu Arg Lys Glu Gly Arg 
30 35 

2 5 ggc aaa ggg aag ggc aag aag aag gag cga ggc tec ggc 15 5 

Gly Lys Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly 
40 45 50 

aag aag ccg gag tec gcg gcg ggc age cag age cca gcc 194 
Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala 
30 55 60 

ttg cct ccc caa ttg aaa gag atg aaa age cag gaa teg 233 
Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser 
65 70 75 

get gca ggt tec aaa eta gtc ctt egg tgt gaa acc agt 2 72 

3 5 Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 

80 85 90 

tct gaa tac tec tct etc aga ttc aag tgg ttc aag aat 311 
Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
95 100 

40 999 aa t gaa ttg aat cga aaa aac aaa cca caa aat ate 3 50 
Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie 
105 110 115 
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aag ata caa aaa aag cca ggg aag tea gaa ctt cgc att 389 

Lys lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie 
120 125 

aac aaa gca tea ctg get gat tct gga gag tat atg tgc 428 

5 Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
130 135 140 

aaa gtg ate age aaa tta gga aat gac agt gee tct gee 46 7 

Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 
145 150 155 

10 aat ate acc ate gtg gaa tea aac gag ate ate act ggt 506 

Asn lie Thr lie Val Glu Ser Asn Glu lie He Thr Gly 
160 165 

atg cca gec tea act gaa gga gca tat gtg tct tea gag 545 

Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu 

15 170 175 180 

tct ccc att aga ata tea gta tec aca gaa gga gca aat 584 

Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn 
185 190 

act tct tea tct aca tct aca tec acc act ggg aca age 623 

2 0 Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
195 200 205 

cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt 662 

His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
210 215 220 

25 gtg aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea 701 

Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
225 230 

aac ccc teg aga tac ttg tgc aag tgc cca aat gag ttt 740 

Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 

30 235 240 245 

act ggt gat cgc tgc caa aac tac gta atg gec age ttc 779 

Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
250 255 

tac aag cat ctt ggg att gaa ttt atg gag gcg gag gag 818 

35 Tyr Lys His Leu Gly He Glu Phe Met Glu Ala Glu Glu 
260 265 270 

ctg tac cag aag aga gtg ctg acc ata acc ggc ate tgc 857 

Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He Cys 
275 280 285 

40 ate gec etc ctt gtg gtc ggc ate atg tgt gtg gtg gee 896 

He Ala Leu Leu Val Val Gly lie Met Cys Val Val Ala 
290 295 
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tac tgc aaa 
Tyr Cys Lys 
300 

cgt c'tt egg 
5 Arg Leu Arg 



atg aac att 
Met Asn lie 
325 

10 ccc gag aat 
Pro Glu Asn 
340 

aac gtc ate 
Asn Val lie 

15 

gag aca tec 
Glu Thr Ser 
365 

cat cac tec 
20 His His Ser 



tgg age aac 
Trp Ser Asn 
390 

25 cac tct gta 
His Ser Val 
4 05 

cac age age 
His Ser Ser 

30 

ggc aca gga 
Gly Thr Gly 
430 

cat gee aga 
3 5 His Ala Arg 



cat agt gaa 
His Ser Glu 
455 

40 cgt atg tea 
Arg Met Ser 
470 



acc aag aaa cag egg 
Thr Lys Lys Gin Arg 
305 

cag age ctt egg tct 
Gin Ser Leu Arg Ser 
315 

gec aat ggg cct cac 
Ala Asn Gly Pro His 
330 

gtc cag ctg gtg aat 
Val Gin Leu Val Asn 
345 

tec agt gag cat att 
Ser Ser Glu His lie 
355 

ttt tec acc agt cac 
Phe Ser Thr Ser His 
370 

act act gtc acc cag 
Thr Thr Val Thr Gin 
380 

gga cac act gaa age 
Gly His Thr Glu Ser 
395 

ate gtg atg tea tec 
lie Val Met Ser Ser 
410 

cca act ggg ggc cca 
Pro Thr Gly Gly Pro 
420 . 

ggc cct cgt gaa tgt 
Gly Pro Arg Glu Cys 
435 

gaa acc cct gat tec 
Glu Thr Pro Asp Ser 
445 

agg tat gtg tea gee 
Arg Tyr Val Ser Ala 
460 

cct gta gat ttc cac 
Pro Val Asp Phe His 
475 



aaa aag ctg cat gac 
Lys Lys Leu His Asp 
310 

gaa cga aac aat atg 
Glu Arg Asn Asn Met 
320 

cat cct aac cca ccc 
His Pro Asn Pro Pro 
335 

caa tac gta tct aaa 
Gin Tyr Val Ser Lys 
350 

gtt gag aga gaa gca 
Val Glu Arg Glu Ala 
360 

tat act tec aca gee 
Tyr Thr Ser Thr Ala 
375 

act cct age cac age 
Thr Pro Ser His Ser 
385 

ate ctt tec gaa age 
lie Leu Ser Glu Ser 
400 

gta gaa aac agt agg 
Val Glu Asn Ser Arg 
415 

aga gga cgt ctt aat 
Arg Gly Arg Leu Asn 
425 

aac age ttc etc agg 
Asn Ser Phe Leu Arg 
440 

tac cga gac tct cct 
Tyr Arg Asp Ser Pro 
450 

atg acc acc ccg get 
Met Thr Thr Pro Ala 
465 

acg cca age tec ccc 
Thr Pro Ser Ser Pro 
480 
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aaa teg ccc cct teg gaa atg tct cca ccc gtg tec age 
Lys Sef Pro Pro Ser Glu Met Ser Pro Pro Val Ser Ser 
485 490 

atg acg gtg tec atg cct tec atg gcg gtc age ccc ttc 
5 Met Thr Val Ser Met Pro Ser Met Ala Val Ser Pro Phe 
495 500 505 

atg gaa gaa gag aga cct eta ctt etc gtg aca cca cca 
Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro 
510 515 

10 agg ctg egg gag aag aag ttt gac cat cac cct cag cag 
Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin 
520 525 530 

ttc age tec ttc cac cac aac ccc gcg cat gac agt aac 
Phe Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn 
15 535 540 545 

age etc cct get age ccc ttg agg ata gtg gag gat gag 
Ser Leu Pro Ala Ser Pro Leu Arg lie Val Glu Asp Glu 
550 555 

qag tat gaa acg ace caa gag tac gag cca gee caa gag 
^; Glu Tyr Glu Thr Thr Gin Glu Tyr Glu Pro Ala Gin Glu 
560 565 570 

ccc gtt aag aaa etc gec aat age egg egg gee aaa aga 
Pro Val Lys Lys Leu Ala Asn Ser Arg Arg Ala Lys Arg 
575 580 

25 ace aag ccc aat ggc cac att get aac aga ttg gaa gtg 
Thr Lys Pro Asn Gly His lie Ala Asn Arg Leu Glu Val 
585 590 595 

gac age aac aca age tec cag age agt aac tea gag agt 
Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser Glu Ser 
30 600 605 610 

gaa aca gaa gat gaa aga gta ggt gaa gat acg cct ttc 
Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 
615 620 

ctg ggc ata cag aac ccc ctg gca gee agt ctt gag gca 
35 Leu Gly He Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala 
625 630 635 

aca cct gee ttc cgc ctg get gac age agg act aac cca 
Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro 
640 645 

4 0 gca ggc cgc ttc teg aca cag gaa gaa ate cag gec agg 
Ala Gly Arg Phe Ser Thr Gin Glu Glu He Gin Ala Arg 
650 655 660 
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ctg tct agt gta att get aac caa gac cct att get gta ta 2029 

Leu Ser Ser Val lie Ala Asn Gin Asp Pro lie Ala Val 

665 670 675 

a aacctaaata aacacataga ttcacctgta aaactttatt 2070 

5 ttatataata aagtattcca ccttaaatta aacaatttat tttattttag 2120 

cagttctgea aatagaaaac aggaaaaaaa cttttataaa ttaaatatat 2170 

gtatgtaaaa atgaaaaaaa aaaaaaaaa 2199 

<210> 5 
<211> 637 
10 <212> PRT 

<213> Homo sapiens 

<400> 5 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 

15 10 15 

15 Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 

20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 

35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 

20 50 55 60 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

65 70 75 

Glu Leu Asn Arg Lys Asn Lys 1-ro Gin Asn lie Lys lie Gin Lys 

80 85 90 

2 5 Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 

95 100 105 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 

110 115 120 

Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 

30 125 130 135 

He Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 

140 145 150 

Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 

155 160 165 

3 5 Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 

170, 175 180 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 

185 190 195 
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Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
200 205 210 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
215 220 225 

5 Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val 

230 235 240 

Leu Thr He Thr Gly He Cys He Ala Leu Leu Val Val Gly He 
245 250 255 

Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
10 260 265 270 

Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn 
275 280 285 

Met Met Asn He Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

15 Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He 

305 310 315 

Ser Ser Glu His He Val Glu Arg Glu Ala Glu Thr Ser Phe Ser 
320 325 330 

Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr 
20 335 340 345 

Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser He 
350 355 360 

Leu Ser Glu Ser His Ser Val He Val Met Ser Ser Val Glu Asn 
365 370 375 

25 Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 

380 385 390 

Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala 
395 400 405 

Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 
30 410 415 420 

Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp 
425 430 435 

Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser 
440 445 450 

3 5 Pro Pro Val Ser Ser Met Thr Val Ser Lys Pro Ser Met Ala Val 

455 460 465 

Ser Pro Phe Met Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro 
470 475 480 
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Pro Arg Leu Arg Glu Lys Lys Phe Asp His His Pro Gin Gin Phe 
485 490 495 

Ser Ser Phe His His Asn Pro Ala His Asp Ser Asn Ser Leu Pro 
500 505 510 

5 Ala Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr Thr 

515 520 525 

Gin Glu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys Leu Ala Asn 
530 535 540 

Ser Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn 
10 545 550 555 

Arg Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn Ser 
560 565 570 

Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe 
575 580 585 

15 Leu Gly lie Gin Asn Pro Leu Ala Ala Ser Leu Glu Ala Thr Pro 

590 595 600 

Ala Phe Arg Leu Ala Asp Ser Arg Thr Asn Pro Ala Gly Arg Phe 
605 610 615 

Ser Thr Gin Glu Glu lie Gin Ala Arg Leu Ser Ser Val lie Ala 

pvisdocio - mo 3$ * j- * 6 2 0 6 2 5 63 0 

Asn Gin Asp Pro lie Ala Val 
635 637 

<210> 6 
<211> 2490 

2 5 <212> DNA 

<213> Home sapiens 

<400> 6 

gtggctgcgg ggcaattgaa aaagagcegg cgaggagttc cccgaaactt 50 

gttggaactc cgggctcgcg eggaggecag gagctgagcg gcggcggctg 100 

3 0 ccggacgatg ggagcgtgag caggaeggtg ataacccctc cccgatcggg 150 

ttgegaggge geegggcaga ggccaggacg cgagccgcca geggegggae 200 

ccatcgacga cttcccgggg cgacaggagc agccccgaga gecagggega 250 

gcgcccgttc caggtggccg gaccgcccgc cgcgtccgcg ccgcgctccc 300 

tgcaggcaac gggagacgee cccgcgcagc gcgagcgcct cagcgcggcc 3 50 

35 gctcgctctc cccatcgagg gacaaacttt tcccaaaccc gatccgagcc 400 

cttggaccaa actcgcctgc gccgagagcc gtccgegtag agcgctccgt 450 
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ctccggcgag atg tec gag cgc aaa gaa ggc aga ggc aaa 4 90 
Met Ser Glu Arg Lys Glu Gly Arg Gly Lys 
15 10 

ggg aag ggc aag aag aag gag cga ggc tec ggc aag aag 529 
5 Gly Lys Gly Lys Lys Lys Glu Arg Gly Ser Gly Lys Lys 

15 20 

ccg gag tec gcg gcg ggc age cag age cca gec ttg cct 568 
Pro Glu Ser Ala Ala Gly Ser Gin Ser Pro Ala Leu Pro 
25 30 35 

10 ccc caa ttg aaa gag atg aaa age cag gaa teg get gca 607 
Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala 
40 45 

ggt tec aaa eta gtc ctt egg tgt gaa acc agt tct gaa 646 
Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
15 50 55 60 

tac tec tct etc aga ttc aag tgg ttc aag aat ggg aat 685 
Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
65 70 75 

gaa ttg aat cga aaa aac aaa cca caa aat ate aag ata 724 
2 0 Glu ueu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie 

80 85 

caa aaa aag cca ggg aag tea gaa ctt cgc att aac aaa 76 3 
Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys 
90 95 100 

25 gca tea ctg get gat tct gga gag tat atg tgc aaa gtg 802 
Ala Ser _au Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
105 110 

ate age aaa tta gga aat gac agt gee tct gee aat ate 841 
lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie 
30 115 120 125 

acc ate gtg gaa tea aac gag ate ate act ggt atg cca 880 
Thr lie Val Glu Ser Asn Glu lie lie Thr Gly Met Pro 
130 135 140 

gec tea act gaa gga gca tat gtg tct tea gag tct ccc 919 
35 Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser Pro 

145 150 

att aga ata tea gta tec aca gaa gga gca aat act tct 958 
lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr Ser 
155 160 165 

40 tea tct aca tct aca tec acc act ggg aca age cat ctt 997 
Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu 
170 175 
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gtia aaa tgt gcg gag aag gag aaa act ttc tgt gtg aat 1036 
Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
180 185 190 

gga ggg gag tgc ttc atg gtg aaa gac ctt tea aac ccc 1075 
5 Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro 
195 200 205 

teg aga tac ttg tgc aag tgc cca aat gag ttt act ggt 1114 
Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly 
210 215 

10 gat cgc tgc caa aac tac gta atg gec age ttc tac aag 1153 
Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys 
220 225 230 

gcg gag gag ctg tac cag aag aga gtg ctg acc ata ace 1192 
Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr lie Thr 
15 235 240 

ggc ate tgc ate gee etc ctt gtg gtc ggc ate atg tgt 1231 
Gly lie Cys lie Ala Leu Leu Val Val Gly lie Met Cys 
245 f 250 255 

gtg gtg gec tac tgc aaa acc aag aaa cag egg aaa aag 12 70 
20 w'aiValAla Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 

ctg cat gac cgt ctt egg cag age ctt egg tct gaa cga 1309 
-wo saeiiisw? i^eu H i s Asp Arg Leu Arg Gln Ser Leu Arg Ser Glu 

275 280 

25 aac aat atg atg aac att gec aat ggg cct cac cat cct 1348 
Asn Asn Met Met Asn lie Ala Asn Gly Pro His His Pro 
285 290 295 

aac cca ccc ccc gag aat gtc cag ctg gtg aat caa tac 1387 
Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr 
30 300 305 

gta tct aaa aac gtc ate tec agt gag cat att gtt gag 1426 
Val Ser Lys Asn Val lie Ser Ser Glu His lie Val Glu 
310 315 320 

aga gaa gca gag aca tec ttt tec acc agt cac tat act 1465 

3 5 Arg Glu Ala Glu Thr Ser Phe Ser Thr Ser His Tyr Thr 

325 330 335 

tec aca gee cat cac tec act act gtc acc cag act cct 1504 
Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 
340 345 

4 0 age cac age tgg age aac gga cac act gaa age ate ctt 1543 

Ser His Ser Trp Ser Asn Gly His Thr Glu Ser lie Leu 
350 355 360 
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tec gaa 
Ser Glu 



aac agt 
5 Asn Ser 
375 

cgt ctt 
Arg Leu 



10 ttc etc 
Phe Leu 



gac tct 
Asp Ser 
15 415 

acc ccg 
Thr Pro 



age tec 
20 Ser Ser 
440 

gtg tec 
Val Ser 



25 age ccc 
Ser Pro 



aca cca 
Thr Pro 
30 480 

cct cag 
Pro Gin 



gac agt 
3 5 Asp Ser 
505 

gag gat 
Glu Asp 



4 0 gee caa 
Ala Gin 



age cac tct 
Ser His Ser 
365 

agg cac age 
Arg His Ser 



aat ggc aca 
Asn Gly Thr 
390 

agg cat gec 
Arg His Ala 
405 

cct cat agt 
Pro His Ser 



get cgt atg 
Ala Arg Met 
430 

ccc aaa teg 
Pro Lys Ser 



age atg acg 
Ser Met Thr 
455 

ttc atg gaa 
Phe Met Glu 
470 

cca agg ctg 
Pro Arg Leu 



cag ttc age 
Gin Phe Ser 
495 

aac age etc 
Asn Ser Leu 



gag gag tat 
Glu Glu Tyr 
520 

gag cct gtt 
Glu Pro Val 
535 



gta ate gtg 
Val lie Val 



age cca act 
Ser Pro Thr 
380 

gga ggc cct 
Gly Gly Pro 
395 

aga gaa acc 
Arg Glu Thr 



gaa agg tat 
Glu Arg Tyr 
420 

tea cct gta 
Ser Pro Val 



ccc cct teg 
Pro Pro Ser 
445 

gtg tec aag 
Val Ser Lys 
460 

gaa gag aga 
Glu Glu Arg 



egg gag aag 
Arg Glu Lys 
485 

tec ttc cac 
Ser Phe His 



cct get age 
Pro Ala Ser 
510 

gaa acg acc 
Glu Thr Thr 
525 

aag aaa etc 
Lys Lys Leu 



atg tea tec 
Met Ser Ser 
370 

999 99<= cca 
Gly Gly Pro 
385 

cgt gaa tgt 
Arg Glu Cys 



cct gat tec 
Pro Asp Ser 
410 

gtg tea gec 
Val Ser Ala 



gat ttc cac 
Asp Phe His 
435 

gaa atg tct 
Glu Met Ser 
450 

cct tec atg 
Pro Ser Met 



cct eta ctt 
Pro Leu Leu 
475 

aag ttt gac 
Lys Phe Asp 



cac aac ccc 
His Asn Pro 
500 

ccc ttg agg 
Pro Leu Arg 
515 

caa gag tac 
Gin Glu Tyr 



gec aat age 
Ala Asn Ser 
540 



gta gaa 1582 
Val Glu 



aga gga 1621 
Arg Gly 



aac age 1660 
Asn Ser 
400 

tac cga 16 9 9 
Tyr Arg 



atg acc 1738 
Met Thr 
425 

acg cca 1777 
Thr Pro 



cca ccc 1816 
Pro Pro 



gcg gtc 1855 
Ala Val 
465 

etc gtg 1894 
Leu Val 



cat cac 1933 
His His 
490 

gcg cat 1972 
Ala His 



ata gtg 2011 
He Val 



gag cca 2050 
Glu Pro 
530 

egg egg 208 9 
Arg Arg 



-19- 



BNSOOCID: <WO eS39729A2J_> 



WO 99/39729 PCT7US99/02390 

gcc aaa aga acc aag ccc aat ggc cac att get: aac aga 2128 
Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala Asn Arg 
545 550 555 

ttg gaa gtg gac age aac aca age tec cag age agt aac 2167 
5 Leu Glu Val Asp Ser Asn Thr Ser Ser Gin Ser Ser Asn 
560 565 

tea gag agt gaa aca gaa gat gaa aga gta ggt gaa gat 2206 
Ser Glu Ser Glu Thr Glu Asp Glu Arg Val Gly Glu Asp 
570 575 580 

10 acg cct ttc ctg ggc ata cag aac ccc ctg gca gcc agt 2245 
Thr Pro Phe Leu Gly lie Gin Asn Pro Leu Ala Ala Ser 
585 590 595 

ctt gag gca aca cct gcc ttc cgc ctg get gac age agg 2284 
Leu Glu Ala Thr Pro Ala Phe Arg Leu Ala Asp Ser Arg 
15 600 605 

act aac cca gca ggc cgc ttc teg aca cag gaa gaa ate 2323 
Thr Asn Pro Ala Gly Arg Phe Ser Thr Gin Glu Glu lie 
610 615 620 

cag gcc agg ctg tct agt gta att get aac caa gac cct 2362 
20 Gin Ala Arg Leu Ser Ser Val lie Ala Asn Gin Asp Pro 

625 630 

att get gta taaaaccta aataaacaca tagattcacc tgtaaaactt 2410 
r-M£;»':i:: --mq - ^ Ile Ala Val 
635 637 

25 tattttatat aataaagtat tccaccttaa attaaacaat ttattttatt 2460 

ttagcagttc tgcaaataaa aaaaaaaaaa 24 90 

<210> 7 
<211> 241 
<212> PRT 
30 <213> Homo sapiens 

<400> 7 

Met Ser. Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
15 10 15 

Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 
35 20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
35 40 45 

Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 
50 55 60 

4 0 Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 

65 70 75 
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Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 
80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

5 Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 

110 115 120 

Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 
125 130 135 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
10 140 145 150 

Glu Ser Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
170 175 180 

15 Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 

185 190 - ■ 195 

Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 
200 205 210 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
20 215 220 225 

Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
230 235 240 

Glu 
24i 

25 <210> 8 

<211> 1715 
<212> DNA 

<213> Homo sapiens 
<400> 8 

3 0 gcgcctgcct ccaacctgcg ggcgggaggt gggtggctgc ggggcaattg 50 
aaaaagagcc ggcgaggagt tccccgaaac ttgttggaac tccgggctcg 100 
cgcggaggcc aggagctgag cggcggcggc tgccggacga tgggagcgtg 150 
agcaggacgg tgataacctc tccccgatcg ggttgcgagg gcgccgggca 2 00 
gaggccagga cgcgagccgc cagcggcggg acccatcgac gacttcccgg 2 50 

3 5 ggcgacagga gcagccccga gagccagggc gagcgcccgt tccaggtggc 3 00 
cggaccgccc gccgcgtccg cgccgcgctc cctgcaggca acgggagacg 350 
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cccccgcgca gcgcgagcgc ctcagcgcgg ccgctcgctc tccccatcga 400 

gggacaaact tttcccaaac ccgatccgag cccttggacc aaactcgcct 450 

gcgccgagag ccgtccgcgt agagcgctcc gtctccggcg ag atg 4 95 

Met 

5 1 

tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag 534 
Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 
5 10 

aag aag gag cga ggc tec ggc aag aag ccg gag tec gcg 573 
10 Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

gcg ggc age cag age cca gec ttg cct ccc caa ttg aaa 612 
Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

15 gag atg aaa age cag gaa teg yet gca ggt tec aaa eta 651 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

gtc ctt egg tgt gaa acc agt tct gaa tac tec tct etc 690 
Val Leu Arg Cys Glu Thr. Ser Ser Gxu Tyr Ser Ser Leu 
20 55 60 65 

aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga 72 9 
Bnaocoit • <w\cf .•■-vv'TArg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 

70 75 

aaa aac aaa cca caa aat ate aag ata caa aaa aag cca 76 8 
2 5 Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

ggg aag tea gaa ctt cgc att aac aaa gca tea ctg get 807 
Gly Lys Ser Glu Leu Arg He Asn Lys Ala Ser Leu Ala 
95 100 105 

30 gat tct gga gag tat atg tgc aaa gtg ate age aaa tta 846 
Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys Leu 
110 115 

gga aat gac agt gec tct gec aat ate acc ate gtg gaa 885 
Gly Asn Asp Ser Ala Ser Ala Asn He Thr He Val Glu 
35 120 125 130 

tea aac gag ate ate act ggt atg cca gec tea act gaa 924 
Ser Asn Glu He lie Thr Gly Met Pro Ala Ser Thr Glu 
135 140 

gga gca tat gtg tct tea gag tct ccc att aga ata tea 963 
40 Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg He Ser 
145 150 155 
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gta tec aca gaa gga gca aat act tct tea tct aca tct 1002 
Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 
160 165 170 

aca tec ace act ggg aca age cat ctt gta aaa tgt gcg 1041 
5 Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 

175 180 

gag aag gag aaa act ttc tgt gtg aat gga ggg gag tgc 108 0 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 

10 ttc atg gtg aaa gac ctt tea aac ccc teg aga tac ttg 1119 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

tgc aag tgc cca aat gag ttt act ggt gat cgc tgc caa 1158 
Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
15 210 215 220 

aac tac gta atg gee age ttc tac agt acg tec act ccc 1197 
Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
225 230 235 

ttt ctg tct ctg cct gaa tagga geatgetcag ttggtgctgc 1240 
2 0 Phe Leu Ser Leu Pro Glu 

240 241 

tttcttgttg ctgcatctcc cctcagattc cacctagagc tagatgtgtc 1290 
ttaccagatc taatattgac tgcctctgcc tgtcgcatga gaacattaac 1340 
aaaagcaatt gtattacttc ctctgttcgc gactagttgg ctctgagata 1390 

25 ctaataggtg tgtgaggctc cggatgtttc tggaattgat attgaatgat 1440 
gtgatacaaa ttgatagtca atatcaagca gtgaaatatg ataataaagg 1490 
catttcaaag tctcactttt attgataaaa taaaaatcat tctactgaac 1540 
agtccatctt ctttatacaa tgaccacatc ctgaaaaggg tgttgctaag 1590 
ctgtaaccga tatgeacttg aaatgatggt aagttaattt tgattcagaa 164 0 

30 tgtgttattt gtcacaaata aacataataa aaggagttca gatgtttttc 1690 

ttcattaacc aaaaaaaaaa aaaaa 1715 

<210> 9 
<211> 420 
<212> PRT 
35 <213> Homo sapiens 

<400> 9 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys 
1 5 10 15 
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Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser 
20 25 30 

Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin 
35 40 45 

5 Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser 

50 55 60 

Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly Asn 
65 70 75 

Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys 
10 80 85 90 

Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn 
110 115 120 

15 Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Glu lie 

125 130 135 

lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser 
140 145 150 

Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala Asn Thr 
20 155 160 165 

Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 
170 175 180 

Lys Cys Ala Glu Lys Glu Lys Vhr Phe Cys Val Asn Gly Gly Glu 
185 190 195 

2 5 Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys 

200 205 210 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val 
215 220 225 

Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr Gin Lys Arg Val 
30 230 235 240 

Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu Val Val Gly lie 
245 250 255 

Met Cys Val Val Ala Tyr Cys Lys Thr Lys Lys Gin Arg Lys Lys 
260 265 270 

3 5 Leu His Asp Arg Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn 

275 280 285 

Met Met Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 
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Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val lie 
305 310 315 

Ser Ser Glu His lie Val Glu Arg Glu Ala Glu Thr Ser Phe Ser 
320 325 330 

5 Thr Ser His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr 

335 340 345 

Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu Ser lie 
350 355 360 

Leu Ser Glu Ser His Ser Val lie Val Met Ser Ser Val Glu Asn 
10 365 370 375 

Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
380 385 390 

Gly Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala 
395 400 405 

1 Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser Glu Arg 

410 415 .420 

• 2 I 0 > 10 

• r 1 1 ^ 2431 
• DNA 

2 0 * 2 1 J • Homo sapiens 

..4 00> 10 

gaggcgcctg cctccaacct gcgggcggga ggtgggtggc tgcggggcaa 5 0 
ttgaaaaaga gccggcgagg agttccccga aacttgttgg aactccgggc 100 
ccgcgcggag gccaggagct gagcggcggc ggctgccgga cgatgggagc 150 

2 5 gcgagcagga cggtgataac ctctccccga tcgggttgcg agggcgccgg 200 

gcagaggcca ggacgcgagc cgccagcggc gggacccatc gacgacttcc 250 

cggggcgaca ggagcagccc cgagagccag ggcgagcgcc cgttccaggt 3 00 

ggccggaccg cccgccgcgt ccgcgccgcg ctccctgcag gcaacgggag 3 50 

acgcccccgc gcagcgcgag cgcctcagcg cggccgctcg ctctccccat 400 

30 cgagggacaa acttttccca aacccgatcc gagcccttgg accaaactcg 450 

cctgcgccga gagccgtccg cgtagagcgc tccgtctccg gcgag atg 4 98 

Met 
1 

tec gag cgc aaa gaa ggc aga ggc aaa ggg aag ggc aag 53 7 

3 5 Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys 

5 10 
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aag aag gag cga ggc tec ggc aag aag ccg gag tec gcg 576 
Lys Lys Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala 
15 20 25 

gcg ggc age cag age cca gec ttg cct ccc caa ttg aaa 615 
5 Ala Gly Ser Gin Ser Pro Ala Leu Pro Pro Gin Leu Lys 
30 35 40 

gag atg aaa age cag gaa teg get gca ggt tec aaa eta 654 
Glu Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu 
45 50 

10 gtc ctt egg tgt gaa ace agt tct gaa tac tec tct etc 693 
Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
55 60 65 

aga ttc aag tgg ttc aag aat ggg aat gaa ttg aat cga 732 
Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg 
15 70 75 

aaa aac aaa cca caa aat ate aag ata caa aaa aag cca 771 
Lys Asn Lys Pro Gin Asn lie Lys lie Gin Lys Lys Pro 
80 85 90 

ggg aag tea gaa ctt cgc att aac aaa gca tea ctg get 810 
2 0 Gly Lys Ser Glu j_.eu Arg lie Asn Lys Ala Ser Leu Ala 
95 100 105 



gat tct gga gag tat atg tgc aaa gtg ate age aaa tta 849 
Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
110 115 



25 gga aat gac agt gee tct gee aat ate ace ate gtg gaa 888 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu 
120 125 130 

tea aac gag ate ate act ggt atg cca gee tea act gaa 927 
Ser Asn Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu 
30 135 140 

gga gca tat gtg tct tea gag tct ccc att aga ata tea 966 
Gly Ala Tyr Val Ser Ser Glu Ser Pro lie Arg lie Ser 
145 150 155 

gta tec aca gaa gga gca aat act tct tea tct aca tct 1005 

3 5 Val Ser Thr Glu Gly Ala Asn Thr Ser Ser Ser Thr Ser 

160 165 170 

aca tec ace act ggg aca age cat ctt gta aaa tgt gcg 1044 
Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala 
175 180 

4 0 gag aag gag aaa act ttc tgt gtg aat gga ggg gag tgc 108 3 

Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
185 190 195 
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ttc atg gtg aaa gac ctt tea aac ccc teg aga tac ttg 1122 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
200 205 

tgc aag tgt cca aat gag ttt act ggt gat cgc tgc caa 1161 
5 Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin 
210 215 220 

aac tac gta atg gec age ttc tac aag gcg gag gag ctg 1200 
Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu 
225 230 235 

10 tac cag aag aga gtg ctg acc ata acc ggc ate tgc ate 123 9 
Tyr Gin Lys Arg Val Leu Thr lie Thr Gly lie Cys lie 
240 245 

gec etc ctt gtg gtc ggc ate atg tgt gtg gtg gec tac 1278 
Ala Leu Leu Val Val Gly lie Met Cys Val Val Ala Tyr 
15 250, 255 260 

tgc aaa acc aag aaa cag egg aaa aag ctg cat gac cgt 1317 
Cys Lys 'Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg 
265 270 

ctt egg cag age ctt egg tct gaa cga aac aat atg atg 1356 

2 0 Leu Arg Gin Ser Leu Arg Ser Glu Arg Asn Asn Met Met 

275 280 285 

aac att gec aat ggg cct cac cat cct aac cca ccc ccc 1395 
Asn lie Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 
290 295 300 

25 gag aat gtc cag ctg gtg aat caa tac gta tct aaa aac 1434 
Glu Asn Val Gin xjeu Val Asn Gin Tyr Val Ser Lys Asn 
305 310 

gtc ate tec agt gag cat att gtt gag aga gaa gca gag 1473 
Val lie Ser Ser Glu His lie Val Glu Arg Glu Ala Glu 
30 315 320 325 

aca tec ttt tec acc agt cac tat act tec aca gec cat 1512 
Thr Ser Phe Ser Thr Ser His Tyr Thr Ser Thr Ala His 
330 335 

cac tec act act gtc acc cag act cct age cac age tgg 1551 

3 5 His Ser Thr Thr Val Thr Gin Thr Pro Ser His Ser Trp 

340 345 350 

age aac gga cac act gaa age ate ctt tec gaa age cac 1590 
Ser Asn Gly His Thr Glu Ser lie Leu Ser Glu Ser His 
355 360 365 

40 tct gta ate gtg atg tea tec gta gaa aac agt agg cac 1629 
Ser Val lie Val Met Ser Ser Val Glu Asn Ser Arg His 
370 375 
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age age cca act ggg ggc cca aga gga cgt ctt aat ggc 1668 
Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly 
380 385 390 

aca gga ggc cct cgt gaa tgt aac age ttc etc agg cat 1707 
Thr Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His 
395 400 

gee aga gaa ace cct gat tec tac cga gac tct cct cat 1746 
Ala Arg Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His 
405 410 415 

agt gaa agg taaaa ccgaaggcaa agetactgea gaggagaaac 1790 
Ser Glu Arg 
420 

tcagtcagag aatccctgtg agcacctgcg gtctcacctc aggaaatcta 1840 

ctctaatcag aataaggggc ggcagttacc tgttctagga gtgetcctag 1890 

ttgatgaagt catctctttg tttgacggaa cttatttctt ctgagcttct 1940 

ctcgtcgtcc cagtgactga caggcaacag actcttaaag agctgggatg 1990 

etttgatgeg gaaggtgcag cacatggagt ttccagctct ggccatgggc 2040 

tcagacccac teggggtetc agtgtcctca gttgtaacat tagagagatg 2090 

geatcaatge ttgataagga cccttctata attccaattg ccagttatcc 2140 

aaactctgat tcggtggtcg agctggcctc gtgttcttat ctgctaaccc 2190 

tgtcttacct tccagcctca gttaagtcaa atcaagggct atgtcattgc 2240 

tgaatgtcat ggggggcaac tgcttgccct ccaccctata gtatctattt 2290 

tatgaaattc caagaaggga tgaataaata aatctcttgg atgctgcgtc 2340 

tggcagtctt cacgggtggt tttcaaagca gaaaaaaaaa aaaaaaaaaa 2390 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa a 2431 

<210> 11 
<211> 768 
<212> PRT 

<213> Homo sapiens 
<400> 11 

Met Asp Val Lys Glu Arg Lys Pro Tyr Arg Ser Leu Thr Arg Arg 
15 10 15 

Arg Asp Ala Glu Arg Arg Tyr Thr Ser Ser Ser Ala Asp Ser Glu 
20 25 30 

Glu Gly Lys Ala Pro Gin Lys Ser Tyr Ser Ser Ser Glu Thr Leu 
35 40 45 
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Lys Ala Tyr Asp Gin Asp Ala Arg Leu Ala Tyr Gly Ser Arg Val 
50 55 60 

Lys Asp lie Val Pro Gin Glu Ala Glu Glu Phe Cys Arg Thr Gly 
65 70 75 

5 Ala Asn Phe Thr Leu Arg Glu Leu Gly Leu Glu Glu Val Thr Pro 

80 85 90 

Pro His Gly Thr Leu Tyr Arg Thr Asp lie Gly Leu Pro His Cys 
95 100 105 

Gly Tyr Ser Met Gly Ala Gly Ser Asp Ala Asp Met Glu Ala Asp 
10 110 115 120 

Thr Val Leu Ser Pro Glu His Pro Val Arg Leu Trp Gly Arg Ser 
125 130 135 

Thr Arg Ser Gly Arg Ser Ser Cys Leu Ser Ser Arg Ala Asn Ser 
140 145 150 

15 Asn Leu Thr Leu Thr Asp Thr Glu His Glu Asn Thr Glu Thr Asp 

155 160 y 165 

His Pro Gly Gly Leu Gin Asn His Ala Arg Leu Arg Thr Pro Pro 
170 175 180 

Pro Pro Leu Ser His Ala His Thr Pro Asn Gin His His Ala Ala 
20 185 190 195 

Ser lie Asn Ser Leu Asn Arg Gly Asn Phe Thr Pro Arg Ser Asn 
200 205 210 

Pro Ser Vro Ala Pro Thr Asp His Ser Leu Ser Gly Glu Pro Pro 
215 220 225 

2 5 Ale Gly Gly Ala Gin Glu Pro Ala His Ala Gin Glu Asn Trp Leu 

230 235 240 

Leu Asn Ser Asn lie Pro Leu Glu Thr Arg Asn Leu Gly Lys Gin 
245 250 255 

Pro Phe Leu Gly Thr Leu Gin Asp Asn Leu lie Glu Met Asp lie 
30 260 265 270 

Leu Gly Ala Ser Arg His Asp Gly Ala Tyr Ser Asp Gly His Phe 
275 280 285 



Leu Phe Lys Pro Gly 
290 



Arg Pro Leu Pro Arg 
320 



Gly Thr Ser Pro Leu 
295 



Ser Thr Phe Ala Arg 
325 



Phe Cys Thr Thr Ser 
300 

Tyr Ser Pro Pro Pro 
.315 

Pro Ala Phe Asn Leu 
330 



Pro Gly Tyr Pro Leu Thr Ser Ser Thr Val 
305 310 
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Lys Lys Pro Ser Lys Tyr Cys Asn Trp Lys Cys Ala Ala Leu Ser 
335 340 345 

Ala lie Val lie Ser Ala Thr Leu Val lie Leu Leu Ala Tyr Phe 
350 355 360 

Val Ala Met His Leu Phe Gly Leu Asn Trp His Leu Gin Pro Met 
365 370 375 

Glu Gly Gin Met Tyr Glu lie Thr Glu Asp Thr Ala Ser Ser Trp 
380 385 390 

Pro Val Pro Thr Asp Val Ser Leu Tyr Pro Ser Gly Gly Thr Gly 
395 400 405 

Leu Glu Thr Pro Asp Arg Lys Gly Lys Gly Thr Thr Glu Gly Lys 
410 415 420 

Pro Ser Ser Phe Phe Pro Glu Asp Ser Phe lie Asp Ser Gly Glu 
425 430 435 

lie Asp Val Gly Arg Arg Ala Ser Gin Lys lie Pro Pro Gly Thr 
440 445 450 

Phe Trp Arg Ser Gin Val Phe lie Asp His Pro Val His Leu Lys 
455 460 465 

Phe Asn Val Ser Leu Gly Lys Ala Ala Leu Val Gly lie Tyr Gly 
470 475 480 

Arg Lys Gly Leu Pro Pro Ser His Thr Gin Phe Asp Phe Val Glu 
485 490 495 

Leu _eu Asp Gly Arg Arg Leu Leu Thr Gin Glu Ala Arg Ser Leu 
500 505 510 

Glu Gly Thr Pro Arg Gin Ser Arg Gly Thr Val Pro Pro Ser Ser 
515 520 525 

His Glu Thr Gly Phe lie Gin Tyr Leu Asp Ser Gly lie Trp His 
530 535 540 

Leu Ala Phe Tyr Asn Asp Gly Lys Glu Ser Glu Val Val Ser Phe 
545 550 555 

Leu Thr Thr Ala lie Ala Leu Pro Pro Arg Leu Lys Glu Met Lys 
560 565 570 

Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys Glu 
575 580 585 

Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
590 595 600 



Gly Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys lie 
605 610 615 



WO 99/39729 PCT/US99/02390 

Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser 
620 625 630 

Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu - 
635 640 645 

5 Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn 

650 655 660 

Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr Val 
665 670 675 

Ser Ser Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala 
10 680 685 690 

Asn Thr Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His 
695 700 705 

Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly 
710 715 720 

15 Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

725 - 730 ... 735 

Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn 
740 745 750 

Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser 
20 755 760 765 

Leu Pro Glu 
768 

<-:io> 12 

<211> 3111 
25 <212> DNA 

<213> Homo sapiens 

<400> 12 

gggtaccatg ggtcggtgag cgcgtttccc gcctgagcgc aactagcggc 50 

gggtcgtggg cacctccaga aaagatcccg caccatcctc caggatccaa 100 

30 tggccttgga gagagggctg cagggcccac ggacattgct gactcttcag 150 

aacgtgctga catggagcca ggtagactga aattatcatg tgtccaaatt 2 00 

aaaattgcat acttcaagga ttatttgaag gactattctt agaccctttt 250 

aagaagattt aaagaaaaac cactcggccc tgagtgcggc gaggaccctg 300 

tttgtggatg tggaggagcg cgggccggag gcc atg gac gtg 34 2 
3 5 Met Asp Val 

1 

aag gag agg aag cct tac cgc teg ctg acc egg cgc cgc 381 
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Lys Glu Arg Lys Pro Tyr Arg Ser Leu Thr Arg Arg Arg 
5 10 15 

gac gcc gag cgc cgc tac acc age teg tec gcg gac age 420 
Asp Ala Glu Arg Arg Tyr Thr Ser Ser Ser Ala Asp Ser 
5 20 25 

gag gag ggc aaa gcc ccg cag aaa teg tac age tec age 4 59 
Glu Glu Gly Lys Ala Pro Gin Lys Ser Tyr Ser Ser Ser 
30 35 40 

gag acc ctg aag gcc tac gac cag gac gcc cgc eta gcc 498 
10 Glu Thr Leu Lys Ala Tyr Asp Gin Asp Ala Arg Leu Ala 
45 50 55 

tat ggc age cgc gtc aag gac att gtg ccg cag gag gcc 53 7 
Tyr Gly Ser Arg Val Lys Asp He Val Pro Gin Glu Ala 
60 65 

15 gag gaa ttc tgc cgc aca ggt gcc aac ttc acc ctg egg 576 
Glu Glu Phe Cys Arg Thr Gly Ala Asn Phe Thr Leu Arg 
70 75 80 

gag ctg ggg ctg gaa gaa gta acg ccc cct cac ggg acc 615 
Glu Leu Gly Leu Glu Glu Val Thr Pro Pro His Gly Thr 
20 85 90 

ctg tac egg aca gac att ggc etc ccc cac tgc ggc tac 654 
Leu Tyr Arg Thr Asp He Gly Leu Pro His Cys Gly Tyr 

5M»DOCiO' ^MCT 'ta^.: .-;*.. -95 100 105 

tec atg ggg get ggc tct gat gcc gac atg gag get gac 693 
25 Ser Met Gly Ala Gly Ser Asp Ala Asp Met Glu Ala Asp 
110 115 120 

acg gtg ctg tec cct gag cac ccc gtg cgt ctg tgg ggc 732 
Thr Val Leu Ser Pro Glu His Pro Val Arg Leu Trp Gly 
125 130 

3 0 egg age aca egg tea ggg cgc age tec tgc ctg tec age 771 

Arg Ser Thr Arg Ser Gly Arg Ser Ser Cys Leu Ser Ser 
135 140 145 

c gg gcc aat tec aat etc aca etc acc gac acc gag cat 810 
Arg Ala Asn Ser Asn Leu Thr Leu Thr Asp Thr Glu His 
35 150 155 

gaa aac act gag act gat cat ccg ggc ggc ctg cag aac 84 9 
Glu Asn Thr Glu Thr Asp His Pro Gly Gly Leu Gin Asn 
160 165 170 

cac gcg egg etc egg acg ccg ccg ccg ccg etc teg cac 888 

4 0 His Ala Arg Leu Arg Thr Pro Pro Pro Pro Leu Ser His 

175 180 185 

gcc cac acc ccc aac cag cac cac gcg gcc tec att aac 927 
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Ala His Thr Pro Asn Gin His His Ala Ala Ser lie Asn 
190 195 

tec ctg aac egg ggc aac ttc acg ccg agg age aac ccc 966 
Ser Leu Asn Arg Gly Asn Phe Thr Pro Arg Ser Asn Pro 
200 205 210 



age ccg gee ccc acg gac cac teg etc tec gga gag ccc 1005 
Ser Pro Ala Pro Thr Asp His Ser Leu Ser Gly Glu Pro 
215 220 



cct gee ggc ggc gec cag gag cct gec cac gec cag gag 1044 
10 Pro Ala Gly Gly Ala Gin Glu Pro Ala His Ala Gin Glu 
225 230 235 



aac tgg ctg etc aac age aac ate ccc ctg gag ace aga 1083 
Asn Trp Leu Leu Asn Ser Asn lie Pro Leu Glu Thr Arg 
240 245 250 



15 aac eta 
Asn Leu 



aac etc 
Asn Leu 
20 265 

gat ggg 
Asp Gly 



gga ggc 
25 Gly Gly 
290 

tac cca 
Tyr Pro 



ggc aag cag 
Gly Lys Gin 
255 

att gag atg 
lie Glu Met 



get tac agt 
Ala Tyr Ser 
280 

ace tec ccg 
Thr Ser Pro 



ctg acg tec 
Leu Thr Ser 
305 



cca ttc eta 
Pro Phe Leu 



gac att etc 
Asp lie Leu 
270 

gac ggg cac 
Asp Gly His 



etc ttc tgc 
Leu Phe Cys 
295 

age aca gtg 
Ser Thr Val 
310 



ggg aca ttg 
Gly Thr Leu 
260 

ggc gee tec 
Gly Ala Ser 



ttc etc ttc 
Phe Leu Phe 
285 

ace aca tea 
Thr Thr Ser 
300 

tac tct cct 
Tyr Ser Pro 



cag gac 1122 
Gin Asp 



cgc cat 1161 
Arg His 
275 

aag cct 1200 
Lys Pro 



cca ggg 123 9 
Pro Gly 

ccg ccc 127 8 
Pro Pro 
315 



30 cga ccc ctg ccc cgc age ace ttc gee egg ccg gec ttt 1317 
Arg Pro Leu Pro Arg Ser Thr Phe Ala Arg Pro Ala Phe 
320 325 



aac etc aag aag ccc tec aag tac tgt aac tgg aag tgc 1356 
Asn Leu Lys Lys Pro Ser Lys Tyr Cys Asn Trp Lys Cys 
35 330 335 340 



gca gee ctg age gec ate gtc ate tea gee act ctg gtc 13 95 
Ala Ala Leu Ser Ala lie Val lie Ser Ala Thr Leu Val 
345 350 



ate ctg ctg gca tac ttt gtg gee atg cac ctg ttt ggc 1434 
4 0 lie Leu Leu Ala Tyr Phe Val Ala Met His Leu Phe Gly 
355 360 365 



eta aac tgg cac ctg cag ccg atg gag ggg cag atg tat 14 7 3 
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Leu Asn Trp His Leu Gin Pro Met Glu Gly Gin Met Tyr 
370 375 380 

gag ate acg gag gac aca gec age agt tgg cct gtg cca 1512 
Glu lie Thr Glu Asp Thr Ala Ser Ser Trp Pro Val Pro 
5 385 390 

acc gac gtc tec eta tac ccc tea ggg ggc act ggc tta 1551 
Thr Asp Val Ser Leu Tyr Pro Ser Gly Gly Thr Gly Leu 
395 400 405 

gag acc cct gac agg aaa ggc aaa gga acc aca gaa gga 15 90 
10 Glu Thr Pro Asp Arg Lys Gly Lys Gly Thr Thr Glu Gly 
410 415 

aag ccc agt agt ttc ttt cca gag gac agt ttc ata gat 1629 
Lys Pro Ser Ser Phe Phe Pro Glu Asp Ser Phe lie Asp 
420 425 430 

15 tct gga gaa att gat gtg gga agg cga get tec cag aag 1668 
Ser Gly Glu lie Asp Val Gly Arg Arg Ala Ser Gin Lys 
435 440 445 

att cct cct ggc act ttc tgg aga tct caa gtg ttc ata 1707 
lie Pro Pro Gly Thr Phe Trp Arg Ser Gin Val Phe lie 
20 450 455 

gac cat cct gtg cat ctg aaa ttc aat gtg tct ctg gga 1746 
Asp His Pro Val His Leu Lys Phe Asn Val Ser Leu Gly 
460 465 470 

aag gca gec ctg gtt ggc att tat ggc aga aaa ggc etc 178 5 
25 Lys Ala Ala Leu Val Gly lie Tyr Gly Arg Lys Gly Leu 
475 480 

cct cct tea cat aca cag ttt gac ttt gtg gag ctg ctg 1824 
Pro Pro Ser His Thr Gin Phe Asp Phe Va) Glu Leu Leu 
485 490 495 

30 gat ggc agg agg etc eta acc cag gag gcg egg age eta 1863 
Asp Gly Arg Arg Leu Leu Thr Gin Glu Ala Arg Ser Leu 
500 505 510 

gag ggg acc ccg cgc cag tct egg gga act gtg ccc ccc 1902 
Glu Gly Thr Pro Arg Gin Ser Arg Gly Thr Val Pro Pro 
35 515 520 

tec age cat gag aca ggc ttc ate cag tat ttg gat tea 1941 
Ser Ser His Glu Thr Gly Phe lie Gin Tyr Leu Asp Ser 
525 530 535 

gga ate tgg cac ttg get ttt tac aat gac gga aag gag 1980 
40 Gly lie Trp His Leu Ala Phe Tyr Asn Asp Gly Lys Glu 

540 „ 545 

tea gaa gtg gtt tec ttt etc acc act gec att gec ttg 2019 
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Ser Glu Val Val Ser Phe Leu Thr Thr Ala lie Ala Leu 
550 555 560 

cct ccc cga ttg aaa gag atg aaa age cag gaa teg get 2058 
Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu Ser Ala 
5 565 570 575 

gca ggt tec aaa eta gtc ctt egg tgt gaa ace agt tct 2097 
Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
580 585 

gaa tac tec tct etc aga ttc aag tgg ttc aag aat ggg 2136 
10 Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn Gly 
590 595 600 

aat gaa ttg aat cga aaa aac aaa cca caa aat ate aag 2175 
Asn Glu Leu Asn Arg Lys Asn Lys Pro Gin Asn lie Lys 
605 610 

15 ata caa aaa aag cca ggg aag tea gaa ctt cgc att aac 2214 
lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn 
615 620 625 

aaa gca tea ctg get gat tct gga gag tat atg tgc aaa 2253 
Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys 
20 630 635 640 

gtg ate age aaa tta gga aat gac agt gee tct gee aat 2292 
Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 
645 650 

ate ace ate gtg gaa tea aac gag ate ate act ggt atg 2331 
25 He Thr He Val Glu Ser Asn Glu He He Thr Gly Met 
655 . 660 665 

cca gec tea act gaa gga gca tat gtg tct tea gag tct 2370 
Pro Ala Ser Thr Glu Gly Ala Tyr Val Ser Ser Glu Ser 
670 675 

30 ccc att aga ata tea gta tec aca gaa gga gca aat act 2409 
Pro He Arg He Ser Val Ser Thr Glu Gly Ala Asn Thr 
680 , 685 690 

tct tea tct aca tct aca tec ace act ggg aca age cat 2448 
Ser Ser Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser His 
35 695 700 705 

ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt gtg 2487 
Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val 
710 715 

aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea aac 2526 
4 0 Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn 
720 725 730 

ccc teg aga tac ttg tgc aag tgc cca aat gag ttt act 2565 
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Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr 
735 740 

ggt gat cgc tgc caa aac tac gta atg gcc age ttc tac 2604 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr 
745 750 755 

agt acg tec act ccc ttt ctg tct ctg cct gaa tag 2640 
Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
760 765 768 

gagcatgetc agttggtgct gctttcttgt tgetgeatet cccctcagat 2690 

tccacctaga gctagatgtg tcttaccaga tctaatattg actgcctctg 2740 

cctgtcgcat gagaacatta acaaaagcaa ttgtattact tcctctgttc 2790 

gcgactagtt ggctctgaga tactaatagg tgtgtgaggc tccggatgtt 284 0 

tctggaattg atattgaatg atgtgataca aattgatagt caatatcaag 2890 

cagtgaaata tgataataaa ggcatttcaa agtctcactt ttattgataa 2940 

aataaaaatc attctactga acagtccatc ttctttatac aatgaccaca 2990 

tcctgaaaag ggtgttgcta agctgtaacc gatatgeact tgaaatgatg 304 0 

gtaagttaat tttgattcag aatgtgttat ttgtcacaaa taaacataat 3090 

r aaaaggaaaa aaaaaaaaaa a 3111 

<210> 13 

<211> 1872 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gaattcggga cagcctctcc tgccgccgct gctgctgccg ccgccgccac 50 
cgccggctgg tcctccttct gcttttactt ctcctgcatg acagttgttt 100 
tcttcatctg agcagacacc agcttcagat gctcgaggtg agaaacatgc 150 
ctttcagttt gggctactgg tttacttaat taatcagccg gcagctccgt 200 
cgatctattt tcgtccctgt cctcttgacg agecegggat ggtttggagt 250 
agcatttaaa agaactagaa aagtggccca gaaacagcag cttaaagaat 300 
tattacgata tactttgatt ttgtagttgc taggagcttt tcttcccccc 350 
ttgcatcttt ctgaactctt cttgatttta ataatggect tggacttgga 400 
cgatttatcg atttccccct gtaagatget gtatcatttg gttggggggg 450 
cctctgcgtg gtaatggacc gtgagagegg ccaggccttc ttctggaggt 500 
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gagccg 



gcc gcc 
. 5 Ala Ala 



gca gac 
Ala Asp 
25 

10 atg cca 
Met Pro 



etc gtg 
Leu Val 

15 

get gag 
Ala Glu 
65 

lac att 
2f 'jyr- He 



etc tgc 
Leu Cys 
90 

25 ate ttt 
He Phe 



ggg tta 
Gly Leu 

30 

get gcc 
Ala Ala 
130 

tea cct 
3 5 Ser Pro 



gtt aca 
Val Thr 
155 

40 cca tea 
Pro Ser 



atg gag att 
Met Glu He 
l 

gag agg tec 
Glu Arg Ser 
15 

cca tct ctt 
Pro Ser Leu 



gag ccc cag 
Glu Pro Gin 
40 

ggc ctg gcc 
Gly Leu Ala 
55 

cgc ctg aga 
Arg Leu Arg 



gtc ccc ate 
Val Pro He 
80 

ate gcc ggc 
He Ala Gly 



gaa tat gac 
Glu Tyr Asp 
105 

ggc cag gac 
Gly Gin Asp 
120 

tea get gtg 
Ser Ala Val 



gtc tct agg 
Val Ser Arg 
145 

gtg caa ggt 
Val Gin Gly 



gcg gca ccg 
Ala Ala Pro 
170 



tat tec cca 
Tyr Ser Pro 
5 

tec age ccc 
Ser Ser Pro 



gat ggg ctt 
Asp Gly Leu 
30 

act gaa gat 
Thr Glu Asp 
45 

gtg ccc tgc 
Val Pro Cys 



ggt tgc etc 
Gly Cys Leu 
70 

ctg get tgc 
Leu Ala Cys 



etc aag tgg 
Leu Lys Trp 
95 

tct cct act 
Sei Pro Thr 
110 

cct att att 
Pro He He 



tgg gtg teg 
Trp Val Ser 
135 

get caa tct 
Ala Gin Ser 



gac aag get 
Asp Lys Ala 
160 

aca ccg aag 
Thr Pro Lys 
175 



gac atg tec 
Asp Met Ser 



tec act cag 
Ser Thr Gin 
20 

ccg gca gca 
Pro Ala Ala 
35 

ggg aga ace 
Gly Arg Thr 



tgt gcg tgc 
Cys Ala Cys 
60 

aac tea gag 
Asn Ser Glu 



ctg gtc age 
Leu Val Ser 
85 

gta ttt gtg 
Val Phe Val 
100 

cac ctt gac 
His Leu Asp 



tct ctg gac 
Ser Leu Asp 
125 

tct gag gca 
Ser Glu Ala 



gaa agt gag 
Glu Ser Glu 
150 

gtt gtc tec 
Val Val Ser 
165 

aat cgt att 
Asn Arg He 



gag gtc 539 
Glu Val 
10 

ctg agt 578 
Leu Ser 



gaa gac 617 
Glu Asp 



cct gga 6 56 
Pro Gly 
50 

eta gaa 695 
Leu Glu 



aaa ate 734 
Lys He 

75. 

etc tgc 773 
Leu Cys 



gac aag 812 
Asp Lys 



cct ggg 8 51 
Pro Gly 
115 

gca act 890 
Ala Thr 



tac act 929 
Tyr Thr 
140 

gtr caa 968 
Val Gin 



ttt gaa 1007 
Phe Glu 



ttt gcc 1046 
Phe Ala 
180 



-37- 



BNSDOCIO; <WO 9939729A2J„> 



4 



WO 99/39729 PCT/US99/02390 

ttt tct ttc ttg ccg tec act gcg cca tec ttc cct tea 1085 
Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser 
185 190 

ccc ace egg aac cct gag gtg aga acg ccc aag tea gca 1124 
5 Pro Thr Arg Asn Pro Glu Val Arg Thr Pro Lys Ser Ala 
195 200 205 

act cag cca caa aca aca gaa act aat etc caa act get 1163 
Thr Gin Pro Gin Thr Thr Glu Thr Asn Leu Gin Thr Ala 
210 215 

10 cct aaa ctt tct aca tct aca tec acc act ggg aca age 1202 
Pro Lys Leu Ser Thr Ser Thr Ser Thr Thr Gly Thr Ser 
220 225 230 

cat ctt gta aaa tgt gcg gag aag gag aaa act ttc tgt 1241 
His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys 
15 235 240 245 

gtg aat gga ggg gag tgc ttc atg gtg aaa gac ctt tea 1280 
Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 
250 255 

aac ccc teg aga tac ttg tgc aag tgc cca aat gag ttt 1319 
2 0 Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe 
260 265 270 

act ggt gat cgc tgc caa aac tac gta atg gee age ttc 1358 

Ry;^X>^- - av;> ' a;. . Thr Gly Asp Cys Qln Asn Tyr Val Met Ala Ser phe 

275 280 

25 tac agt acg tec act ccc ttt ctg tct ctg cct gaa taggag 1400 
Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
285 290 295 296 

catgetcagt tggtgctgct ttcttgttgc tgcatctccc ctcagattcc 1450 

acctagagct agatgtgtct taccagatct aatattgact gcctctgcct 1500 

30 gtegcatgag aacattaaca aaagcaattg tattacttcc tctgttcgcg 1550 
actagttggc tctgagatac taataggtgt gtgaggctcc ggatgtttct 1600 
ggaattgata ttgaatgatg tgatacaaat tgatagtcaa tatcaagcag 1650 
tgaaatatga taataaaggc atttcaaagt ctcactttta ttgataaaat 1700 
aaaaatcatt ctactgaaca gtccatcttc tttatacaat gaccacatcc 1750 

35 tgaaaagggt gttgetaage tgtaaccgat atgcacttga aatgatggta 1800 
agttaatttt gattcagaat gtgttatttg tcacaaataa acataataaa 1850 
aggaaaaaaa aaacccgaat tc 1872 
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<210> 14 
<211> 296 
<212> PRT 

<213> Homo sapiens 
5 <400> 14 

Met Glu lie Tyr Ser Pro Asp Met Ser Glu Val Ala Ala Glu Arg 
15 10 15 

Ser Ser Ser Pro Ser Thr Gin Leu Ser Ala Asp Pro Ser Leu Asp 
20 25 30 

10 Gly Leu Pro Ala Ala Glu Asp Met Pro Glu Pro Gin Thr Glu Asp 

35 40 45 

Gly Arg Thr Pro Gly Leu Val Gly Leu Ala Val Pro Cys Cys Ala 
50 55 60 

Cys Leu Glu Ala Glu Arg Leu Arg Gly Cys Leu Asn Ser Glu Lys 
15 65 70 75 

lie Cys lie Val Pro lie Leu Ala Cys Leu Val Ser Leu Cys Leu 
80 85 90 

Cys lie Ala Gly Leu Lys Trp Val Phe Val Asp Lys He Phe Glu 
95 100 105 

2 0 Tyr Asp Ser Pro Thr His Leu Asp Pro Gly Gly Leu Gly Gin Asp 

110 115 120 

Pro He He Ser Leu Asp Ala Thr Ala Ala Ser Ala Val Trp Val 
125 130 135 

Ser Ser Glu Ala Tyr Thr Ser Pro Val Ser Arg Ala Gin Ser Glu 
25 140 145 150 

Ser Glu Val Gin Val Thr Val Gin Gly Asp Lys Ala Val Val Ser 
155 160 165 

Phe Glu Pro Ser Ala Ala Pro Thr Pro Lys Asn Arg He Phe Ala 
170 175 180 

30 Phe Ser Phe Leu Pro Ser Thr Ala Pro Ser Phe Pro Ser Pro Thr 

185 190 195 

Arg Asn Pro Glu Val Arg Thr Pro Lys Ser Ala Thr Gin Pro Gin 
200 205 210 

Thr Thr Glu Thr Asn Leu Gin Thr Ala Pro Lys Leu Ser Thr Ser 
35 215 220 225 

Thr Ser Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys 
230 235 240 

Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys 
245 250 255 
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Asp Leu Ser Asn Pro 
260 

Phe Thr Gly Asp Arg 
275 



Ser Arg Tyr Leu Cys 
265 

Cys Gin Asn Tyr Val 
280 



Lys Cys Pro Asn Glu 
270 

Met Ala Ser Phe Tyr 
285 



Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
290 295 296 
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